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DIAGNOSIS OF KIDNEY DAMAGE AND PROTECTION AGAINST SAME 

BACKGRODND OF THE INVENTION 

Field of the Invention 
5 The invention relates to various nucleic acid molecules 

and proteins, and their use in (1) diagnosing kidney damage, 
or conditions associated with the development of kidney 
damage, and (2) protecting maimnals (including humans) 
against kidney damage - 

10 

Description of the Background Art 

Kidney Anatomy and Physiology 

In humans, the kidneys are the main excretory organs, 

eliminating urea, citric acid, creatinine and other waste 
15 metabolites. The kidneys also conserve or excrete water and 

electrolytes as required. Finally, they produce the hormone 

erythropoietin as well .as other proteins. 

The kidneys are paired organs situated on the posterior 

wall of the adrenal cavity.. The cortex is the outer layer. 
20 The medulla consists of multiple triangular renal pyramids 

whose bases are covered by the cortex and whose tips 
(papillae) project into the minor calyces of the renal 

pelvis. Within the renal pelvis, the minor calyces join to 

form a major calyx, and the major calyces join to form the 
25 renal pelvis, the expanded upper end of the ureter. 

The functional unit of the kidney, where urine is 

formed, is the nephron. Each nephron consists of (1) a knot 

of coiled capillaries called a glomerulus, and (2) a tubule. 

The proximal end of the tubule forms a cup, the glomerular 
30 capsule, which surrounds the glomerulus. The glomerular 

capsule and glomerulus together form the renal corpuscle. 

The renal corpuscles are located in the cortical region of 

the kidney. Moving away from the glomerular capsule, the 

nephron has, in order (1) the proximal convoluted tubule, 
35 (2) the loop of Henle, and (3) the distal convoluted tubule. 

The distal convoluted tubules of several nephrons empty 

into a collecting tubule, and 10-25 collecting tubules empty 

into each renal pyramid. 



wo 2004/013289 




PCT/US2003/024053 



Blood passes from the aorta to the renal artery^ which 
divides into interlobar arteries (in the renal colximns of 
the cortex/ which separate the pyramids) , then into arcuate 
arteries parallel to the surface of the kidney^ then into 
5 interlobular arteries, and then into afferent arterioles, 
each of which supplies a renal corpuscle and forms a 
capillary network called a glomerulus. The blood* leaves 
through an efferent arteriole that divides into peritubular 
capillaries surrounding the tubules; these capillaries 

10 converge into interlobular veins, which feed arcuate veins, 
which empty into interlobar veins, and then connect to the 
renal vein, which in turn feeds the inferior vena cava. 
There thus are two capillary beds, the glomerulus and the 
peritubular capillaries . 

15 The glomeruli and the renal tubules (nephrons and 

collecting tubules) participate in glomerular filtration, 
tubular reabsorption, and tubular secretion. 

Glomerulosclerosis 

20 Glomerulosclerosis refers to a scarring of the 

glomeruli. The ^^scarring'' is caused by deposition of 
cellular basement membrane proteins such as collagen IV. 
The scarring disrupts the filtering process of the kidneys, 
allowing protein to leak from the blood into the urine 

25 (proteinurea) . About 15% of people with proteinurea are 
diagnosed, usually by kidney biopsy, as having 
glomerulosclerosis. The scarred glomeruli cannot be 
repaired. 

30 End Stage Renal Disease 

Many patients with glomerulosclerosis gradually get 
worse until their kidneys fail completely, a condition known 
as end stage renal disease (ESRD) . Patients with ESRD must 
go on dialysis or receive a kidney transplant (a new 
35 kidney) • The progression of kidney failure can be slowed 
using ACE inhibitors . 

Each year, in the United States, nearly 80,000 people 
suffer kidney failure. In 1998, .the causes of kidney 
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failure were diabetes ' (43 .2%) , high blood pressure (23.0%)/ 
glomerulonephritis (12.3%), polycystic kidney disease 
(2.9%), or other (18.6%). 

5 Diabetes 

Diabetes mellitus is a pleiotropic disease of great 
complexity. The two major types have been termed type I or 
insulin-dependent diabetes mellitus (IDDM) and type II or 
non-insulin-dependent diabetes mellitus (NIDDM) . Type II 

10 diabetes is the predominant form found in the Western world; 
fewer than 8% of diabetic Americans have the type I disease. 
Development of type II diabetes may be linked to a more 
sedentary lifestyle and obesity. 

Type I diabetics are often characterized by their low 

15 or absent levels of circulating endogenous insulin (Unger 

and Fpster, 1998). Islet cell antibodies causing damage to 
the pancreas are frequently present at diagnosis. Injection 
of exogenous insulin is required to prevent ketosis and 
sustain life. Type II diabetics are often characterized by 

20 hyperinsulinemia and an increasing resistance to insulin. 

They are usually not insulin dependent or prone to ketosis 
under normal circumstances. Hyperglycemia can usually be 
controlled by an alteration in diet and amount of exercise, 
but insulin treatment may be required. 

25 Complications of diabetes (end organ damage) include 

retinopathy, neuropathy, and nephropathy (traditionally 
designated as microvascular complications) as well as 
atherosclerosis (a macrovascular complication) . It has been 
shown that meticulous blood glucose control can of ten . plow 

30 down or halt the progression of diabetic complications if 
caught early enough (Unger and Foster, 1998) . However, 
tight metabolic control is extremely difficult to achieve. 

Diabetic Nephropathy 
35 Diabetic nephropathy is the leading cause of end stage 

renal disease. About 20 million people in the United States 
have diabetes, and about 100,000 people have kidney failure 
as a result of diabetes. 
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Initial nephromegaly as well as glomerular hypertrophy and 
microalbuminuria, once thought to be limited to type I 
diabetics, are now seen equally in type II diabetics (Wirta 
et al., 1996). The pathology of diabetic nephropathy begins 
with thickening of the glomerular basement membrane (GBM) , 
an increase in mesangial matrix, and subintimal hyaline 
thickening of both afferent and efferent arterioles (Unger 
and Foster, 1998) . The capacity td clear macromolecules is 
impaired in diabetes, resulting in accumulation of albumin 
and larger proteins within the glomerular wall and in the 
mesangium that may lead to stimulation of mesangial matrix 
production. 

The progression of kidney failure occurs in five stages 
(Wardle, 1996): (I) hyperf iltration (increased blood flow) 
and enlargement ' of the kidneys (11) glomeruli begin to show 
damage, e.g., by microalbuminuria (20-200 ug/min loss of 
albumin); (III) dipstick-positive proteinurea (>200 lag/min) , 
blood levels of creatinine and urea-nitrogen use, (IV) 
glomerular filtration decreases to less than 75 ml/min, and 
(V) kidney failure: filtration rate less than lOml/min-. 

Features of the first stage include an increase in 
kidney volume (renal hypertrophy) and function {increased 
glomerular filtration rate or GFR) . Features of the second 
stage include a thickened GBM. Features of the third stage 
include detection of microalbuminuria. Once this stage is 
reached, there is little chance for reversal and the disease 
progresses on to the fourth stage (glomerulosclerosis and 
macroalbuminuria) and the fifth stage (ESRD) . 



Other Kidney Disorders 

Glomerulonephritis is an inf lamination of the glomeruli, 
often due to an allergic reaction to streptococcal toxins. 

Pyelonephritis is a bacterial infection of the kidney 
pelvis and surrounding tissues. 

Kidney stones may be formed in the renal pelvis. 



Growth Hormone 

Growth hormone has many roles, ranging from regulation 
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of protein, fat and carbohydrate metabolism to growth 
promotion. GH is produced in the somatrophic cells of the 
anterior pituitary and exerts its effects either through the 
GH-induced action of IGF-I, in the case of growth promotion, 
5 or by direct interaction with the GHR on target cells 
including liver, muscle, adipose, and kidney cells. 
Hypos ecretion of GH during development leads to dwarfism, 
and hypersecretion before puberty leads to gigantism. In 
adults, hypersecretion of GH results in acromegaly, a 

10 clinical condition characterized by enlarged facial bones, 
hands, feet, fatigue and an increase in weight. Of those 
individuals with acromegaly, 25% develop type 2 diabetes. 
This may be due to insulin resistance caused by the high 
circulating levels of GH leading to high circulating levels 

15 of insulin (Kopchick et al.. Annual Rev. Nutrition 1999. 
19:437-61). . 

A further mode of GH action may be through the 
transcriptional regulation of a number of genes contributing 
to the physiological effects of GH. 

20 

Diabetic Glomerulosclerosis in Animal Models 

One hallmark of kidney damage is. the development of 
glomerulosclerosis. Several mouse models develop 
glomerulosclerosis similar to that seen in human diabetics. 

25 Streptozotocin (STZ) induced diabetic mice, as well as the 
spontaneously occurring non-obese diabetic mice, develop 
glomerulosclerosis . 

Interestingly, bovine growth hormone (bGH) expressing 
transgenic mice, which are not diabetic but are 

30 hyper insulinemic, develop severe glomerulosclerosis. In 
contrast, both bGH antagonist transgene mice and GH 
receptor/binding protein (GHR/BP) gene-disrupted homozygous 
(-/-) mice are protected from kidney damage, even when 
diabetes is induced by STZ treatment. 

35 McGrane, et al., J. Biol. Chem. 263:11443-51 (1988) and 

Chen, et al., J. Biol. Chem., 269:15892-7 (1994) describe 
the genetic engineering of mice to express bovine growth 
hormone (bGH) or hiaman growth hormone (hGH) , respectively. 
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These mice exhibited an enhanced growth phenotype- They 
also developed kidney lesions similar to those seen in 
diabetic glomerulosclerosis, see Yang, et al . , Lab. Invests, 
68:62-70 (1993). Ogueta, et al., J. Endocrinol., 165: 321-8 

5 (2000) reported that transgenic mice expressing bovine GH 
develop arthritic disorder and self-antibodies. 

Growth hormone genes and the proteins encoded by them " 
can be converted into growth hormone antagonists by 
mutation, see Kopchick USP 5,350,836. Transgenic mice have 

10 ' been made that express the GH antagonists bGH-G119R or hGH 
G120R, and which exhibit a dwarf phenotype. Chen, et al-, 
J. Biol. Chem., 263:15892-7 (1994); Chen, et al., Mol. 
Endocrinol, 5:1845-52 (1991); Chen, et al., Proc. Nat. Acad. 
Sci. USA 87:5061-5 (1990). These mice did not develop 

15 kidney lesions. See Yang (1993), supra. 

Chen, et al.|. Endocrinol, 136:660-7 (1995) compared the 
effect of streptozotocin treatment in normal nontransgenic 
mice, and in mice transgenic for (1) a GH receptor 
antagonist, the G119R mutant of bovine growth hormone or (2) 

20 the E117L-mutant of bGH. (According to Chen's ref. 24, 
these large GH transgenic streptozotocin-treated mice 
constitute an animal model for diabetes.) 
Glomerulosclerosis was seen in diabetic (STZ-treated) 
nontransgenic mice and in diabetic bGH-E117L mice, but not 

25 in diabetic bGH-G119R mice. 

Two of the proteins which mediate growth hormone 
activity are the growth hormone receptor and the growth 
hormone binding protein, encoded by the same gene in 
mice(GHR/BP) . It is possible to genetically engineer mice 

30 so that the gene encoding these proteins is disrupted 

(^^knocked-out"; inactivated), see Zhou, et al., Proc. Nat. 
Acad. Sci. (USA), 94:13215-20 (1997). Zhou, et al. 
inactivated the GHR/BP gene by replacing the 3* portion of 
exon 4 (which encodes a portion of the GH binding domains) 

35 and the 5' region of intron 4 with a neo gene cassette. The 
modified gene was introduced into the target mice by 
homologous recombination. Like mice expressing a GH 
antagonist,, homozygous GHR/BP-KO mice exhibit a dwarf 
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phenotype. GHR/BP-KO mice, made diabetic by streptozotocin 
treatment, are protected from the development of diabetes- 
associated nephropathy. Bellush, et al., Endocrinol., 
141:163-8 (2000). 



Differential/SubtractivB Hybridization 

Zhang, et al.. Kidney International, 56:549-558 (1999) 
identified genes up-regulated in 5/6 nephrectomized 
(subtotal renal ablation) mouse kidney by a PCR-based 
subtraction method. Ten known and nine novel genes were 
identified. The ultimate goal was to identify genes 
involved in glomerular hyper filtration and hypertrophy. 

Melia, et al . Endocrinol., 139:688-95 (1998) applied 
subtractive hybridization methods for the identification of 
androgen-reguiated genes in mouse kidney. The treatment 
mice were dosed .with dihydrotestosterone, an androgen. 
Kidney androgen-reguiated protein gene was used as a 
positive control, as it is known to be up-regulated by DHT. 

Holland, et al . , Abstract 607, "Identification of Genes 
Possibly Involved in Nephropathy of Bovine Growth Hormone 
Transgenic Mice" (Endocrine Society Meeting, June 22, 2000) 
describes the screening strategy of Example 3, but does not 
disclose the identity of any of the differentially expressed 
clones . 

Coschigano, et al.. Abstract 333, "Identification of 
Genes Potentially Involved in Kidney Protection During 
Diabetes" (Endocrine Society Meeting, June 22, 2000) 
describes the screening strategy of Example 1, but does not 
disclose the identity of any of the differentially expressed 
clones . 

The following differential hybridization articles may 
also be of interest: 

Wada, et al., "Gene expression profile in 
streptozotocin-induced diabetic mice kidneys undergoing 
glomerulosclerosis". Kidney Int, 59:1363-73 (2001); 

Song, et al., "Cloning of a novel gene in the human 
kidney homologous to rat muncl3S: its potential role in 
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diabetic nephropathy". Kidney Int., 53:1689-95 (1998); 

Page, et al., "'"Isolation of diabetes-associated kidney 
genes using differential display", Biochem. Biophys . Res. 
Comm., 232:49-53 (1997). 

Peradi, ^'Subtractive hybridization claims: An efficient 
technique to detect overexpressed mRNAs in diabetic 
nephropathy," Kidney Int. 53:926-31 (1998). 

Condorelli, EMBO J. , 17:3858-66 (1998). 

See also WOOO/6.6784 (differential hybridization 
screening for brown adipose tissue); PCT/USOO/12366, filed 
May 5, 2000 (differential hybridization screening for 
liver) . 

Idantxfiaation of gen&a involved in nephropathy 

Attention recently has focused on the generation of 
kidney mRNA , expression profiles and the identification of 
genes involved in nephropathy. Initial studies examined 
expression in the kidney of known genes, such as allV 
collagen, laminin Bl, TGF-Bl, 72 kDa collagenase, and TIMP- 
3, which are involved in extracellular matrix synthesis and 
degradation (Esposito et al., 19967 Striker et al . , 1996; 
Yang et al., 1997; Yang et al., 1994). Changes in 
facilitative glucose transporter, receptors for advanced 
glycation end products (RAGE) , and TGFB isoform and receptor 
messages have also been examined (Chin et al., 1997; Hill et 
al., 2000; Ziyadeh et al . , 1997). With recent advances in 
gene profiling techniques, several databases docxamenting 
mRNA expression in the kidney as a whole as well as in 
individual cell types are being generated (Takenaka et al., 
1998; Virion et al., 1999; Wada et al., 2001). These 
techniques, as well as others, including differential 
display and subtractive hybridization, are being used to 
identify additional genes involved in the development and 
progression of diabetic nephropathy (Condorelli et al., 
1998; Page et al., 1997; Peraldi et al., 1998; Song et al., 
1998; Wada et al., 2001). 
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SUMMARY OF THE INVENTION 

Differential hybridization techniques have been used to 
identify mouse genes that are differentially expressed in 
mice, depending upon the extent or susceptibility to kidney 
5' damage. By identifying related human genes and proteins, 

one may identify agents useful in protecting humans against 
kidney damage. Protection against kidney damage mediated at 
least in part by growth hormone, and/or associated with 
diabetes, is of particular interest. 

10 Based on the differential expression of clones H8, HI, 

Fl, F2, F4, F5, F6, F27, F39, G38, G16, F16, B45, B3 and 
B4 6, we believe that the human proteins most closely 
corresponding to mouse vacuolar adenosine triphosphate, 
ubiquitin protein ligase Nedd-4, SON protein, FUSE binding 

15 protein 1, kidney androgen related protein, claudin 10, and 
heat shock protein 105 kDa (Hsp 105), a phosphotriesterase, 
certain immunoglobulins, and certain other proteins 
identified below, have a protective effect. 

Similarly, based on the differential expression of 

20 clones G26, C22, A8, A34, A39, A48, E39, F21, F38, F40, G9, 
G24 and G28, we believe that it would be desirable to 
inhibit the expression of the htiman proteins most closely 
corresponding to mouse 3-beta-hydroxysteroid dehydrogenase 
type 4, disable-2 p96 (Dab2) , palmitylated serine/threonine 

25 kinase, tumor dif f erentiallly expressed 1 (TDEl), cytochrome 
oxidase III, ERG2 protein and/or glutathione peroxidase 
3, certain immunoglobulins, and certain other proteins 
identified below, or to neutralize their activity, with a 
suitable antagonist (e.g., a binding antibody or 

30 . oligopeptide) . 

Thus, Applicants contemplate: 

(1) use of the ^^f avorable'' mouse DNAs of the Master 
Table (below) to isolate or identify related human DNAs; 

(2) use of human DNAs, related to favorable mouse DNAs, 
35 to express the corresponding human proteins; 

(3) use of the corresponding human proteins (and mouse 
proteins, if the sequence is sufficiently complete to be 
biologically active, and is active in humans), to protect 
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against kidney damage; and 

(4) use of the corresponding mouse or human proteins, 
in diagnostic agents, in assays to measure kidney damage or 
protection against such damage. 
5 Moreover Applicants contemplate: 

(1) use of the ^^unfavorable" mouse DNAs of the Master 
Table to isolate or identify related hviman DNAs; 

(2} use of the complement to the ^^unfavorable" mouse 
DNAs or related human DNAs, as antisense molecules to 
10 inhibit expression of the related human DNAs; 

(3) use of the mouse or human DNAs to express the 
corresponding mouse or human proteins; 

(4) use of the corresponding mouse or human proteins, 
in diagnostic agents, in assays to measure kidney damage; 

15 (5) use of the corresponding mouse or human proteins in 

assays to determine whether a substance binds to (and hence 
may neutralize) the protein; and 

(6) use of the neutralizing substance to protect 
against kidney damage. 

20 The related human DNAs may be Identified by comparing 

the mouse sequence (or its AA translation product) to known 
human DNAs (and their AA translation products) . If this is 
unsuccessful, human cDNA or genomic DNA libraries may be 
screened using the mouse DNA as a probe, 

25 Agents which bind the ^^favorable" and ^^unfavorable" 

nucleic acids, or the corresponding proteins (e.g., an 
antibody vs. the protein) may be used to screen for kidney 
damage . 
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BRIBF DESCRIPTION OF THE DRAWINGS 

Figure 1. Changes in glomerular volume (left panel) and 
urinary albumin excretion (rigrht panel) as a function of 
5 duration of STZ-induced diabetes in wild-type xoice. 

Glomerular measurements were made .from sections of kidney 
fixed in 4% paraformaldehyde' (Fig. lA) . Urinary albvimin 
excretion was determined by radioimmune assay (RIA) using a 
rat albumin antibody. Urinary volume output was normalized 
10 by creatinine concentration comparisons using a standard 

colorimetric assay (Fig. IB) . ND: nondiabetic; DB, diabetic. 

Figure 2. Physiological assessment of the progression of 
nephropathy in the kidney of NT and b6H female mice at 2, 5, 

15 and 12 months of age. Glomerular (Fig. 2A) and mesangial 
(Fig. 2BJ measurements were made from sections of kidney 
fixed in 4% paraformaldehyde. Urinary albumin excretion 
(Fig. 20) was determined by radioimmune assay (RIA) using a 
rat albumin antibody. Urinary volxime output was normalized 

20 by creatinine concentration comparisons using a standard 
colorimetric assay. 
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DETAILED DESCRIPTION OP THE PREFERRED EMBODIMENTS OP THE 
INVENTION 

Identified Differentially Expressed cDNAs 

5 In three differential expression studies (examples 1- 

3), we have obtained cDNAs which were differentially 
expressed. 

In Example 1, we looked ^t differential expression of 
genes in kidneys of streptozotocin-treated mice (a model of 
10 diabetes) as a result of GHR/BP knockout. The clones of 

interest included: G26, R8f lii/ ill/ £28/ G9, Gl£, G24, G28 
(favorables underlined) • 

In Example 2, we obtained data on differential 
15 expression of genes in kidneys of nontransgenic mice as a 
result of streptozotocin treatment. .The clones of interest 
included: Fl. F4, £5, £6, E39, FZr F16. F21, F38, £39, 

and F40- 

20 In Example 3, we studied differential expression of 

genes in kidneys of mice as a result of overproduction of 
growth hormone. The clones of interest included: A8^ A34^ 
A39, A48, B4^f C22^ B3, B46. 

25 

Direct and Indirect Utility of Identified Nucleic Add 
Sequences and Related Molecules 

The cDNAs of the disclosed clones may be used directly. 
For diagnostic or screening purposes, they (or specific . 

30 binding fragments thereof) may be labeled and used as 

hybridization probes. For therapeutic purposes, they (or 
specific binding fragments thereof) may be used as antisense 
reagents to inhibit the expression of the corresponding 
gene, or of a sufficiently homologous gene of another 

35 species. 

If the cDNA appears to be a full-length cDNA, that is, 
that it encodes an entire, functional protein, then it may 
be used in the expression of that protein. Such expression 
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may be in cell culture^ with the protein subsequently 
isolated and administered exogenously to subjects who would 
benefit therefrom^ or in vivo, i.e., administration by gene 
therapy. Naturally, any DNA encoding the same protein, or' a 
5 fragment or a mutant protein which retains the desired 
activity, may be used for the same purpose. The encoded 
protein of course has utility therapeutically and, in 
.labeled or immobilized form, diagnostically . 

The cDNAs of the disclosed clones may also be used 

10 indirectly, that is, to identify other useful DNAs, 
'proteins, or other molecules. We have attempted to 
determine whether the cDNAs disclosed herein have 
significant similarity to any known DNA, and whether, in any 
of the six possible combinations of reference frame and 

15 strand, they encode a protein similar to a known protein. 
If so, then it follows that the known protein, and DNAs 
encoding that protein, may be used in a similar manner. In 
addition, if the known protein is known to have additional 
homologues, then those homologous proteins, and DNAs 

20 encoding them, may be used in a similar manner. 

There thus are several ways that a human protein 
homologue of interest can be identified by database 
searching, including: 

25 

1) a DNA->DNA (BlastN) search for database DNAs closely 
related to the mouse cDNA clone identifies a particular 
mouse (or other nonhuman, e.g., rat) gene, and that nonhuman 
gene encodes a protein for which there is a known human-. 
30 protein homologue; 



2) a DNA->Protein (BlastX) search for database proteins 
closely related to the translated DNA of the mouse cDNA 
clone identifies a particular mouse (or other nonhuman) 
35 protein, and that nonhuman protein has a known human protein 
homologue; 
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3) a DNA->DNA (BlastN) search of the database for huunan DNAs 
closely related to the mouse cDNA clone identifies a 
particular human DNA as a homologue of the mouse cDNA^ and 
the corresponding human protein is known (e.g./ by 

5 translation of the human DNA) ; and 

4) a DNA->Protein (BlastX) search of the database for human 
proteins closely related to the translated DNA of the mouse 
cDNA clone identifies a particular human protein as a 

10 homologue of the corresponding mouse protein. 

Thus, if we have identified a mouse cDNA/ and it 
encodes a mouse protein which appears similar to a human 
protein, then that human protein may be used (especially in 

15 humans) for purposes analogous to the proposed use of the 
mouse protein in mice. Moreover, a specific binding 
fragment of an appropriate strand of the corresponding human 
gene or cDNA could be labeled and used as a hybridization 
probe (especially against samples of human mRNA or cDNA) . 

20 In determining whether the disclosed cDNAs have 

significant similarities to known DNAs (and their translated 
AA sequences to known proteins), one would generally use the 
disclosed cDNA as a query sequence in a search of a sequence 
database. The results of several such searches are set 

25 forth in the Examples. Such results are dependent, to some 
degree, on the search parameters. Preferred parameters are 
set forth in Example 1. The results are also dependent on 
the content of the database. While the raw similarity score 
of a particular target (database) sequence will not vary 

30 with content (as long as it remains in the database) , its 

informational value (in bits), expected value, and relative 
ranking can change. Generally speaking, the changes are 
small . 

35 It is possible to use the sequence of the entire cDNA 

insert to query the database. However, the error rate 
increases as a sequencing run progresses. Hence, it may be 
beneficial to search the database using a truncated 
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(presumably more accurate) sequence/ especially if the 
insert is quite long. 

It will be appreciated that the nucleic acid and 
protein databases keep growing. Hence a later search may 
5 identify high scoring target sequences which were not 

uncovered by an earlier search because the target sequences 
were, not previously part of a database. 

Hence, in a preferred embodiment, the cognate DNAs and 
proteins include not only those set forth in the examples , 
10 but those which would have been highly ranked (top ten, more 
preferably top three, even more preferably top two, most 
preferably the top one) in a search run with the same 
parameters on the date of filing of this application. 

15 If the cDNA appears to be a partial cDNA, it may be * 

used as a hybridization probe to isolate the full-length 
cDNA. If the partial cDNA encodes a biologically functional 
fragment of the cognate protein, it may be used in a manner 
similar to the full length cDNA, i.e., to produce the 

20 • functional fragment. 

If we have indicated . that an antagonist of a protein or 
other molecule is useful, then such an antagonist may be 
obtained by preparing a combinatorial library, as described 

25 below, of potential antagonists, and screening the library 
members for binding to the protein or other molecule in 
question. The binding members may then be further screened 
for the ability to antagonize the biological activity of the 
target. The antagonists may be used therapeutically, or, in 

30 suitably labeled or immobilized form, diagnostically . 

If the cDNA is related to a known protein, then 
sxibstances known to interact with that protein (e.g., 
agonists, antagonists, substrates, receptors, second 
messengers, regulators, and so forth) , and binding molecules 

35 which bind them, are also of utility. Such binding 
molecules can likewise be identified by screening a 
combinatorial library. 
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Isolation of Pull IiengtK cDNAs Using Partial cd^NAs as probes 

If it is determined that a cDNA of the present 
invention is a partial cDNA, and the cognate full length 
cDNA is not listed in a sequence database, the available 
5 cDNA may be used as a hybridization probe to isolate the 
full-length cDNA from a suitable cDNA library. 

Stringent hybridization conditions are appropriate, 
that is, conditions in which the hybridization temperature 
is 5-10 deg. C. below the Tm of the cDNA as a perfect 
.10. duplex. 

Identification and Zsolation of Homologous Genes/cDNAs Using 
a cDNA Probe 

It may be that the sequence databases available do not 

15 include the sequence of any homologous gene, or at least of 
.the homologous gene for a species of interest. However, 
given the cDNAs set forth above, one may readily obtain the 
homologous gene . 

The possession of one cDNA (the "starting DNA") 

20 greatly facilitates the isolation of homologous genes/cDNAs. 
If the clone in question only features a partial cDNA, this 
partial cDNA may first be used as a probe to isolate the 
.corresponding full length cDNA for the same species, and 
that the latter may be used as the starting DNA in the 

25 search for homologous genes. 

The starting DNA, or a fragment thereof, is used as a 
hybridization probe to screen a cDNA or genomic DNA library 
for clones containing inserts which encode either the entire 
homologous protein, or a recognizable fragment thereof. The 

30 minimum length of the hybridization probe is dictated by the 
need for specificity. If the size of the libra;:y in bases 
is L, and the GC content is 50%, then the probe should have 
a length of at least 1, where L = 4^. This will yield, on 
average, a single perfect match in random DNA of L bases. 

35 The human cDNA library is about 10^ bases and the human 
genomic DNA library is about 10^° bases. 

The library is preferably derived from an organism 
which is known, on biochemical evidence, to produce a 
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homologous protein, and more preferably from the genomic DNA 
or mRNA of cells of that organism which are likely to be 
relatively high producers of that protein. A cDNA library 
(which is derived from an mRNA library) is especially 
5 preferred. 

If the organism in question is known to have 
substantially different codon preferences from that of the 
organism whose relevant cDNA or genomic DNA is known, a 
synthetic hybridization probe may.be used which encodes the 

10 same amino acid sequence but whose codon utilization is more 
similar to that of the DNA of the target organism. 
Alternatively, the synthetic probe may eiftploy inosine as a 
substitute for those bases which are most likely to be 
divergent, or the probe, may be a mixed probe which mixes the 

15 codons for the source DNA with the preferred codons 

(encoding the same amino acid) for the target organism. 

• By routine methods, the Tm of a perfect duplex of 
starting DNA is determined. One may then select a 
hybridization temperature which is sufficiently lower than 

20 the perfect duplex Tm to allow hybridization of the starting 
DNA (or other probe) to a target DNA which is divergent from 
the starting DNA. A 1% sequence divergence typically lowers 
the Tm of a duplex by 1-2 "C, and the DNAs encoding . 
homologous proteins of different species typically have 

25 sequence identities of around 50-80%. Preferably, the 

library is screened under conditions where the temperature 
is at least 20"'C., more preferably at least 50°C., below the 
perfect duplex Tm. Since salt reduces the Tm, one 
ordinarily would carry out the search for DNAs encoding 

30 highly homologous proteins under relatively isw salt 

hybridization conditions, e.g., <1M NaCl. The higher the 
salt concentration, and/or the lower the temperature, the 
greater the sequence divergence which is tolerated. 

For the use of probes to identify homologous genes in 

35 other species, see, e.g., Schwinn, et al, , J. Biol. Chem. , 
2 65:8183-89 (1990) (hamster 67-bp cDNA probe vs. human 
leukocyte genomic library; human 0.32kb DNA probe vs. bovine 
brain cDNA library, both with hybridization at 42 °C in 
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6xSSC); Jenkins et al., J. Biol. Chem., 265:19624-31 (1990) 
(Chicken 770-bp cDNA probe vs. human genomic libraries; 
hybridization at 40''C in 50% formamide and 5xSSC) ; Murata et 
al., J. Exp. Med., 175:341-51 (1992) (1.2-kb mouse cDNA 
5 probe v. h\iman eosinophi cDNA library; hybridization at 65 

in 6xSSC); Guyer et al . , J. Biol. Chem-, 265:17307-17. (.1990) 
(2.95-kb human genomic DNA probe vs. porcine genomic DNA 
library; hybridization at 42^*0 in 5xSSC) . The conditions 
. set forth in these articles may each be considered suitable 
10 for the purpose of isolating homologous genes. 

Homologous Prot:eins and DMAs 

A human protein can be said to be identifiable as 
15 homologous to a mouse cDNA clone if 

(1) its sequence can be aligned to the mouse cDNA clone, 
using BlastX with the default parameters set forth below, 
and the expected value (E) of the alignment (the probability 

20 that such an alignment would have occurred by chance alone) 
is less than e-10, or 

(2) it is encoded by a human gene whose cDNA can be aligned 
to the mouse cDNA clone, using BlastN with the default 

25 parameters set forth below, and the E value of the alignment 
is less than e-10. 

Desirably, both conditions are satisfied. 

30 Preferably, for either or both conditions, the E value 

is less than e-15, more preferably less than e-20, still 
more preferably less than e-40, even more preferably less 
than e-60, considerably more preferably less than e-80, and 
most preferably less than e-100. 

35 

BlastN and BlastX report very low expected values as 
^^0.0". This does not truly mean that the expected value is 
exactly zero (since any alignment could occur by chance) , 
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but merely that it is so infinitesimal that it is not 
reported. The documentation does not state the cutoff 
value, but the data below includes alignments with explicit 
E values as low as e-178 (624 bits), while a score of 636 
5 bits was reported as ^^0.0". 

Functionally homologous human proteins are also of 
interest. A human protein may be' said to be functionally 
homologous to the mouse cDNA clone if (1) there is a mouse 

10 protein which is encoded by a mouse gene whose cDNA can be 
aligned to the mouse cDNA clone , using BlastX with the 
default parameters set forth below, and the E value of the 
alignment is less than e-50, and (2) the human protein has 
at least one biological activity in common with the mouse 

15 protein. 

The human proteins of interest also include those that 
are substantially and/or conservatively identical (as 
defined below) to the homologous and/or functionally 
20 homologous human proteins defined above. 



25 Relevance of Favorable and Unfavorable Genes 

If a gene is down-regulated in protected mammals / or 
up-regulated in damaged mammals, (i.e., an ^^unfavorable 
gene") then several utilities are apparent. 

First, the complementary strand of the gene, or a 
30 portion thereof, may be used in labeled form as a 

hybridization probe to detect messenger RNA and thereby 
monitor the level of expression of the gene in a subject. 
Elevated levels are indicative of damage, or possibly of a 
propensity to damage, and clinicians may take appropriate 
35 preventative, curative or ameliorative action. 

Secondly, the messenger RNA product (or equivalent 
cDNA) , the protein product, or a binding molecule specific 
for that product (e.g., an antibody which binds the 



wo 2004/013289 




PCT/IIS2003/024053 



product) , or a downstream product which mediates the 
activity (e.g.r a signaling intermediate) or a binding 
molecule (e.g., an antibody) therefor, may be used, 
preferably in labeled or immobilized form, as an assay 
5 reagent in an assay for said nucleic acid product, protein 
product, or downstream product (e.g., a signaling 
intermediate) . Again, elevated levels are indicative of a 
present or future problem. 

Thirdly,, an agent which down-regulates expression of 

10 the gene may be used to reduce levels of the corresponding 
protein and thereby inhibit further damage to the kidney-. 
This agent could inhibit transcription of the gene in the 
subject, or translation of the corresponding messenger RNA. 
Possible inhibitors of transcription and translation include 

15 antisense molecules and repressor molecules The agent 

could also inhibit a post-translational modification (e.g., 
glycosylation, phosphorylation, cleavage, GPI attachment) 
jcequired for activity, or post-translationally modify the 
protein so as to inactivate it. Or it could be an agent 

20 which down- or up-regulated a positive or negative 
regulatory gene, respectively. 

Fourthly, an agent which is an antagonist of the 
messenger RNA product or protein product of the gene, or of 
a downstream product through which its activity is 

25 manifested (e.g., a signaling intermediate), may be used to 
inhibit its activity. This antagonist could be an antibody. 

Fifthly, an agent which degrades, or abets the 
degradation of, that messenger RNA, its protein product or a 
downstream product which mediates its activity (e.g., a. 

30 signaling intermediate), may be used to curb the effective 
period of activity of the protein. 

If a gene is up-regulated in protected mammals, or 
down -reaulated in damaged animals then the utilities are 
converse to those stated above. 

35 First, the complementary strand of the gene, or a 

portion thereof, may be used in labeled form as a 
hybridization probe to detect messenger RNA and thereby 
monitor the level of expression of the gene in a subject. 
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Depressed levels are indicative of damage, or possibly of a 
propensity to damage, and clinicians may take appropriate 
preventative, curative or ameliorative action. 

Secondly, the messenger RNA product, the equivalent 
5 cDNA, protein product, or a binding molecule specific for 
those products, or a downstream product, or a signaling 
intermediate, or a binding molecule therefor, may be used, 
preferably in labeled or immobilized form, as an assay 
reagent in an assay for said protein product or downstream 

10 product. Again, depressed levels are indicative of a 
present or future problem. 

Thirdly, an agent which up-regulates expression of the 
gene may be used to increase levels of the corresponding 
protein and thereby inhibit further damage to the kidney. 

15 By way of example, it could be a vector which carries a copy 
of the gene, but which expresses the gene at higher levels 
than does the endogenous expression system. Or it could be 
an agent which up- or down-regulates a positive or negative 
regulatory gene. 

20 Fourthly, an agent which is an agonist of the protein 

product of the gene, or of a downstream product through 
which its activity (of inhibition of kidney damage) is 
manifested, or of a signaling intermediate may be used to 
foster its activity. 

25 Fifthly, an agent which inhibits the degradation of 

that protein product or of a downstream product or of a 
signaling intermediate may be used to increase the effective 
period of activity of the protein. 

30 Relevance of Antibody-Related Genes 

Some of the disclosed clones (e.g., A8, A34, A39, A48, 
E39, G38) appear to be related to antibody genes. It is 
widely accepted that diabetes is a microvasculature problem. 
Elevated levels of glucose or lack of insulin or insulin 
35 resistance result in pathological problems with capillaries. 
The lack of proper blood supjply to a variety of tissues 
results in tissue damage including neuronal problems 
(neurology) , kidney problems (nephropathy) , cardiovascular 
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problems^ and eye problems (proliferative diabetic 
retinopathy) • Severe blood vessel problems (macrovascular 
disorders) can lead to necrotic tissue that often results in 
amputations • 

Also, there is considerable evidence that the 
underlying etiology of Type I diabetes is a problem with the 
immune system, in that an autoimmune reaction is mounted 
that results in destruction of the Beta (insulin producing) 
cells of the pancreas. 

Immune reactions have been shown to be involved in 
glomerulosclerosis, see Chen, et al.. Nephron, 78: 63-72 
(1998) and Linder, et al., Clin. Immunol. Immunopathol . r 
1:104-21 (1972), and to occur in transgenic mice expressing 
bovine GH and exhibiting arthritis, see Ogueta, supra. 

Thus, it is entirely possible .that in a type IT 
diabetic (and in other tissues of Type I diabetics), 
autoimmune reactions take place. The scenario could be as 
follows: due to inappropriate blood flow to a given tissue 
(for example, the kidney), pathological changes occur. 
These changes could be in the form of glomerulsclerosis, 
necrosis, or a variety of other problems. Proteins found in 
the tissue may become altered during these pathological 
changes (for example inappropriate glycosylation, 
phosphorylation, protein cleavage) and this ultimately 
results in the exposure of ^"new'' or foreign epitopes to the 
host. Thus, an autoimmune reaction could be mounted against 
these proteins that impact the destruction of the tissue. 

It follows that inhibition of these autoimmune 
antibodies can be beneficial. 

Conceivably, as titers of these autoimmune antibodies 
rise, a new immune response, featuring anti-idiotypic 
antibodies, will occur; such antibodies would be 
^^f avorable" . 
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Genes /Proteins of Interest: 

Growth hormone (GH) is thought to play a role in the 
progression of kidney damage in mice. Bovine (b) GH 
transgenic mice possess elevated levels of insulin-like 
growth factor -1 (IGF-1) , elevated levels of insulin, a 
giant phenotype, and develop severe kidney damage resulting 
in death within 7-12 months. GH antagonist mice have 
decreased levels of IGF-1, a dwarf phenotype, and fail to 
develop kidney damage, even when made diabetic by treatment 
with streptozotocin (Stz) . Therefore, the bGH mice that 
develop kidney damage independent of diabetes but with 
characteristics similar to Stz-induced diabetic nephropathy 
may provide a model system for determining GH' s role in the 
progression of kidney damage. As a consequence of 
determining the genes activated or inactivated in 
correlation with* the morphology of the kidney, as it 
progresses towards glomerulosclerosis, agents may be 
identified which can protect kidneys from glomerulosclerosis 
and other kidney damage. 

Favorable genes /proteins include the following: 
- SON protein 

-vacuolar adenosine triphosphatase 
-Ac39/physophilin 

-ubiquitin protein ligase Nedd-4 
-kindney androgen regulated protein 
-claudins 

-heat shock proteins 

-far upstream element (FUSE) binding protein 

Unfavorable genes /proteins include the following: 
-3 beta-hydroxysteroid dehydrogenase, 
especially of type IV or V, 
-disabled-2 {dab-2) p96 
-palmitylated serine /threonine kinase 
-tumor differentially expressed (TDE) 1 
-cytochrome oxidase III 
-ERG2 protein 
-TLH29 protein precursor 
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-glutathoione peroxidase 3 



Muteuit Proteins 

The present invention also contemplates mutant proteins 
(peptides) which are substantially identical (as defined 
below) to the parental protein (peptide). In general, the 
fewer the mutations, the more likely the mutant protein is 
to .retain the activity of the parental protein. The effect • 
of mutations is usually (but not always) additive. Certain 
individual mutations are more likely to be tolerated than 
others . 

A protein is more likely to tolerate a mutation which 

(a) is a substitution rather than an insertion or 
deletion; 

(b) is an insertion or deletion at the terminus, 
rather than internally, or, if internal, is at a 
domain boundary, or a loop or turn, rather than in 
an alpha helix or beta strand; 

(c) affects a surface residue rather than an 
interior residue; 

(d) affects a part of the molecule distal to the 
binding site;. 

(e) is a substitution of one amino acid for 
another of similar size, charge, and/or 
hydrophobicity, and does not destroy a disulfide 
bond or other crosslink; and 

(f) is at a site which is siibject to substantial 
variation among a family of homologous proteins to 
which the protein of interest belongs. 

These considerations can be used to design functional 
mutants . 

Surface vs. Interior Residues 

Charged residues almost always lie on the surface of 
the protein. For uncharged residues, there is less 
certainty, but in general, hydrophilic residues are 
partitioned to the surface and hydrophobic residues to the 
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interior. Of course, for a membrane protein, the membrane- 
spanning segments are likely to be rich in hydrophobic 
residues. 

Surface residues may be identified experimentally by 
various labeling techniques, or by 3-D structure mapping 
techniques like X-ray diffraction and NMR. A 3-D model- of a 
homologous protein can he helpful. 

Binding Site Residues 

Residues forming the binding site may be identified by 
(1) comparing the effects of labeling the surface residues 
before and after complexing the protein to its target, (2) 
labeling the binding site directly, with affinity ligands, 
(3) fragmenting the protein and testing the fragments for 
binding activity, and (4) systematic mutagenesis (e.g., 
alanine-s canning mutagenesis) to determine which mutants 
destroy binding. If the binding site of a homologous 
protein is known, the binding site may be postulated by 
analogy . 

Protein libraries" may be constructed and screened that 
a large family (e.g., 10") of related mutants may be 
evaluated simultaneously. 

Hence, the mutations are preferably conservative 
modifications as defined below.. 

"Substantially Identical" 

A mutant protein (peptide) is substantially identical 
to a reference protein (peptide) if (a) it has at least 10% 
of a specific binding activity or a non-nutritional 
biological activity of the reference protein, and (b) is at 
least 50% identical in amino acid sequence to the reference 
protein (peptide) . It is "substantially structurally 
identical" if condition (b) applies, regardless of (a) . 

Percentage amino acid identity is determined by 
aligning the mutant and reference sequences according to a 
rigorous dynamic programming algorithm which globally aligns 
their sequences to maximize their similarity, the similarity 
being scored as the sum of scores for each aligned pair 
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according to an unbiased PAM250 matrix, and a penalty for 
each internal gap of -12 for the first null of the gap and - 
4 for each additional null of the same gap. The percentage 
identity is the nvimber of matches expressed as a percentage 
of the adjusted (i.e., counting inserted nulls) length of 
the reference sequence. 

A- mutant DNA sequence is substantially identical to a 
reference DNA sequence if they are structural sequences / and 
encoding mutant and reference proteins which are 
substantially identical as described above. 

If instead they are regulatory sequences, they are 
substantially identical if the mutant sequence has at least 
10% of the regulatory activity of the reference sequence, 
and is at least 50% identical in nucleotide sequence to the 
reference sequence. Percentage identity is determined as 
for proteins except that matches are scored +5, mismatches - 
4, the gap open penalty is -12, and the gap extension 
penalty (per additional null) is -4. 

Preferably, sequence which are substantially identical 
exceed the minimum identity of 50% e.g., are 51%, 66%, 75%, 
80%, 85%, 90%, 95% or 99% identical in sequence. 

DNA sequences may also be considered "substantially 
identical" if they hybridize to each other under stringent 
conditions, i.e., conditions at which the Tm of the 
heteroduplex of the one strand of the mutant DNA and the 
more complementary strand of the reference DNA is not in 
excess of 10 "C. less than the .Tm of the reference DNA 
homoduplex. Typically this will correspond to a percentage 
identity of 85-90%. 

"Conservative Modifications" 

"Conservative modifications" are defined as 

(a) conservative substitutions of amino acids as 
hereafter defined; or 

(b) single or multiple insertions (extension) or 
deletions (truncation) of amino acids at the 
termini . 

Conservative modifications are preferred to other 
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modifications. Conservative substitutions are preferred to 
other conservative modifications. 

"Semi-Conservative Modifications" are modifications 
which are not conservative, but which are (a) semi- 
conservative substitutions as hereafter defined; or (b) 
single or multiple insertions or deletions internally/ but 
at interdomain boundaries, in loops or in other segments of 
relatively high mobility. Semi-conservative modifications 
are preferred to nonconservative modifications. Semi- 
conservative substitutions are preferred to other semi- 
conservative modifications. 

Non-conservative substitutions are preferred to other 
non- conservative modifications . 

The term "conservative" is used here in an ^ prioyj 
sense, i.e., modifications which would be expected to 
preserve 3D structure and activity, based on analysis of the 
naturally occurring families of homologous proteins and of 
past experience with the effects of deliberate mutagenesis, 
rather than post facto , a modification already known to 
conserve activity. Of course, a modification which is 
conservative a nriori may, and usually is, also conservative 
post facto.- 

Preferably, except at the termini, no more than about 
five amino acids are inserted or deleted at a particular 
locus, and the modifications are outside regions known to 
contain binding sites important to activity. 

Preferably, insertions or deletions are liinited to the 
termini . 

A conservative substitution is a substitution of one 
amino acid for another of the same exchange group, the 
exchange groups being defined as follows 

I Gly, Pro, Ser, Ala (Cys) (and any nonbiogenic, 
neutral amino acid with a hydrophobicity not 
exceeding that of the aforementioned a.a.'s) 

II Arg, Lys, His (and any nonbiogenic, positively- 
charged amino acids) 

III Asp, Glu, Asn, Gin (and any nonbiogenic 
negatively-charged amino acids) 
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IV Leu, lie. Met, Val (Cys) (and any nonbiogenic, 
aliphatic, neutral amino acid with a 
hydrophobicity too high for I above) 

V Phe, Trp, Tyr (and any nonbiogenic, aromatic 
neutral amino acid with a hydrophobicity too high 
for I above) . 

Note that Cys belongs to both I and IV. 
Residues Pro, Gly and Cys have special conformational 
roles. Cys participates in formation of disulfide bonds. 
Gly imparts flexibility to the chain, P.ro -imparts rigidity 
to the chain and disrupts a helices. These residues may be 
essential in certain regions of the polypeptide, but 
substitutable elsewhere. 

One, two or three conservative substitutions are more 
likely to be tolerated than a larger number. 

"Semi-conservative substitutions" are defined herein as 
being substitutions within supergroup I/II/IH or within 
supergroup IV/V, but not within a single one of groups I-V. 
They also include replacement of any other amino acid with 
alanine. If a substitution is not conservative, it 
preferably is semi-conservative. 

•••Non-conservative substitutions" are substitutions 
which are not ^^conservative" or "semi-conservative" . 

"Highly conservative substitutions" are- a subset of 
conservative substitutions, and are exchanges of amino acids 
within the groups Phe/Tyr/Trp, Met/Leu/Ile/Val, His/Arg/Lys, 
Asp/Glu and Ser/Thr/Ala. They are more likely to be 
tolerated than other conservative substitutions. Again, the 
smaller the number of substitutions, the more likely they 
are to be tolerated- 

^^Conservatively Identical" 

A protein (peptide) is conservatively identical to a 
reference protein (peptide) it differs from the latter, if 
at all, solely by conservative modifications, the protein 
(peptide remaining at least seven amino acids long if the 
reference protein (peptide) was at least seven amino acids 
long. 



wo 2004/013289 




PCTAJS2003/0240S3 



A protein is at least semi-conservatively identical to 
a reference protein (peptide) if it differs from the latter, 
if at all, solely by setni- conservative or conservative 
modifications . 

5 . A protein (peptide) is nearly conservatively identical 

to a reference protein (peptide) if it differs from the 
latter, if at all, solely by one or more conservative 
.modifications and/or a single nonconservative substitution. 

It is highly conservatively identical if it differs, if 
10 at ^11, solely by highly conservative substitutions. Highly 
conservatively identical proteins are preferred to those 
merely conservatively identical. An absolutely identical 
protein is even more preferred. 

15 

The core sequence of a reference protein (peptide) is- 
the largest single fragment which retains at least 10% of a 
particular specific binding activity, if one is specified, 
or otherwise of at least one specific binding activity of 

20 the referent. If the referent has more than one specific 
binding activity, it may have more than one core sequence, 
and these may overlap or not. 

If it is taught that a peptide of the present invention 
may have a particular similarity relationship (e.g., 

25 markedly identical) to a reference protein (peptide), 
preferred peptides are those which comprise a sequence 
having that relationship to a core sequence of the reference 
protein (peptide), but with internal insertions or deletions 
in either sequence excluded. Even more preferred peptides 

30 are those whose entire sequence has that relationship, with 
the same exclusion, to a core sequence of that reference 
protein (peptide) . 



35 Library 

The term "library" generally refers to a collection of 
chemical or biological entities which are related in origin, 
structure, and/or function, and which can be screened 
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simultaneously for a property of interest. 

Libraries may be classified by how they are constructed 
(natural vs. artificial diversity; combinatorial vs. 
noncombinatorial) , how they are screened (hybridization, 
5 expression, display) , or by the nature of the screened 
library members (peptides, nucleic acids, etc.). 

In a ^^natural diversity" library, essentially all of 
the diversity arose without .human intervention. This would 
be true, for example, of messenger RNA extracted from a non- 
10 engineered cell. 

In a ^'synthetic diversity" library, essentially all of 
the diversity arose deliberately as a result of human 
intervention.- This would be true for example of a 
combinatorial library; note that a small level of natural 
15 diversity could still arise as a result of spontaneous 
mutation. It would also be true of a noncombinatorial 
library of compounds collected from diverse sources, even if 
they were all natural products. 

In a ^"non-natural diversity" library, at least some of 
20 the diversity arose deliberately through human intervention. 

In a ""controlled origin" library, the source of the 
diversity is limited in some way. A limitation might be to 
cells of a particular individual, to a particular species, 
or to a particular genus, or, more complexly, to individuals 
25 of a particular species who are of a particular age, sex, 

physical condition, geographical location, occupation and/or 
familial relationship. Alternatively or additionally, it 
might be to cells of a particular tissue or organ. Or it 
could be cells exposed to particular .pharmacological, 
30 environmental, or pathogenic conditions. Or the library 

could be of chemicals, or a particular class of chemicals, 
produced by such cells. 

In a ""controlled structure" library, the library 
members are deliberately limited by the production 
35 conditions to particular chemical structures. For example, 
if they are oligomers, they may be limited in length and 
monomer composition, e.g. hexapeptides composed of the 
twenty genetically encoded amino acids . 
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Hvbridlz afci-on Library . 

In a hybridization library, the library members are 
nucleic acids, and are screened using a nucleic acid 
hybridization probe. Bound nucleic acids may then be 
5 sequenced. 

Expression Library 

In an expression library, the screened library members 
are gene expression products, but one may also speak of an 

10 underlying library of genes encoding those products. The 
library is made by subcloning DNA encoding the library 
members (or portions thereof) into expression vectors (or 
into cloning vectors which- subsequently are used to 
construct expression vectors) , each vector comprising an 

15 expressible gene encoding a particular library member, 

introducing the expression vectors into suitable • cells, and 
expressing the genes so the expression products are 
produced. 

In one embodiment, the expression products are 

20 secreted, so the library can be screened using an affinity 
reagent, such as an antibody or receptor. The bound 
expression products may be sequenced directly, or their 
sequences inferred by, e.g., sequencing at least the 
variable portion of the encoding DNA. 

25 In a second embodiment, the cells are lysed, thereby 

exposing the expression products, and the latter are 
screened with the affinity reagent. 

In a third embodiment, the cells express the library 
members in such a manner that they are displayed on the 

30 surface of the cells, or on the surface of viral particles 
produced by the cells. (See display libraries, below). 

In a fourth embodiment, the screening is not for the 
ability of the expression product to bind to an affinity 
reagent, but rather for its ability to alter the phenotype 

35 of the host cell in a particular detectable manner. Here, 
the screened library members are transformed cells, but 
there is a first underlying library of expression products 
which mediate the behavior of the cells, and a second 
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underlying library of genes which encode those products. 

pjsplav Library 

In a display library,, the library members are each 
conjugated to, and displayed upon, a support of some kind. 
The support may be living (a cell or virus), or nonliving 
(e.g.-, a bead or plate) . 

If the support is a cell or virus, display will 
normally be effectuated by expressing a fusion protein which 
comprises .the library member, a carrier moiety allowing 
integration of the fusion protein. into the surface of the 
cell or virus, and optionally a lining moiety. In a 
variation on this theme, the cell coexpresses a first fusion 
comprising the library member and. a linking moiety LI, and a 
second fusion comprising a linJcing moiety L2 and the carrier 
moiety. LI and L2 interact to associate .the first fusion 
with the second fusion and hence, indirectly, the library 
member with the surface of the cell or virus. 

sninble Library 

In a soluble library, the library members are free in 
solution. A soluble library may be produced directly, or 
one may first make a display library and then release the 
library members from their supports. 

pncapsulated Library 

In an encapsulated library, the library members are 
inside cells or liposomes. Generally speaking, encapsulated 
libraries are used to store the library members for future 
use; the members are extracted in some way for screening 
purposes. However, if they differentially affect the 
phenotype of the cells, they may be screened indirectly by 
screening the cells. 

cDNA Library 

A cDNA library is usually prepared by extracting RNA 
from cells of particular origin, fractionating the RNA to 
isolate the messenger RNA (mRNA has a poly (A) tail, so this 
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is usually done by oligo-dT affinity chromatography) , 
synthesizing complementary DNA (cDNA) using reverse 
transcriptase r DNA polymerase, and other enzymes, subcloning 
the cDNA into vectors, and introducing the vectors into 
cells. Often, only mRNAs or cDNAs of particular sizes will 
be. used, to make it more likely that the cDNA encodes a- 
functional polypeptide. 

A cDNA library explores the natural diversity of the 
transcribed DNAs of cells from a particular source. It is 
not a combinatorial library. 

A cDNA library may be used to make a hybridization 
library, or it may be used as an (or to make) expression, 
library. 

Genomic DNA Library 

A genomic DNA library is made by extracting DNA from a 
particular source/ fragmenting the DNA, isolating fragments 
of a particular size range, subcloning the DNA fragments 
into vectors, and introducing the vectors into cells. 

Like a cDNA library,, a genomic DNA library is a natural 
diversity library, and not a combinatorial library. A 
genomic DNA library may .be used the same way as a cDNA 
library. 

Synthetic DNA library 

A synthetic DNA library may be screened directly (as a 
hybridization library) , or used in the creation of an 
expression or display library of peptides/proteins . 

Combinat orial Libraries 

The term "combinatorial library" refers to a library in 
which the individual members are either systematic or random 
combinations of a limited set of basic elements, the 
properties of each member being dependent on the choice and 
location of the elements incorporated into it. Typically, 
the members of the library are at least capable of being 
screened simultaneously. Randomization may be complete or 
partial; some positions may be randomized and others 
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predetermined, and at random positions, the choices may be 
limited in a predeterinined manner. The members of a 
•combinatorial library may be oligomers or polymers of some 
kind, in which the variation, occurs through the choice of 

5 monomeric building block at one or more positions of the 

oligomer or polymer, and possibly in terms of the connecting 
linkage, or the length of the oligomer or polymer, too. Or 
the members may be nonoligomeric molecules with a standard 
core structure, like the 1, 4-benzodiazepine structure, with 

10 the variation being introduced by the choice of substituents 
at particular variable sites on the core structure. Or the 
members may be nonoligomeric molecules assembled- like a 
jigsaw puzzle, but wherein each piece has both one or more 
variable moieties (contributing to library diversity) and 

15 one or more constant moieties (providing the functionalities 
for coupling the piece, in question to other pieces) . 

Thus, in a typical combinatorial library, chemical 
building blocks are at least partially randomly combined 
into a large nxunber (as high as 10E15) of different 

20 compounds, which are then simultaneously screened for 

binding (or other) activity against one or more targets. 

In a "simple combinatorial library", all of the members 
belong to the same class of compounds (e.g., peptides) and 
can be synthesized simultaneously. A "composite 

25 combinatorial library" is a mixture of two or more simple 
libraries, e.g., DNAs and peptides, or peptides, peptoids, 
and PNAs, or benzodiazepines and carbamates. The number of 
component simple libraries in a composite library will, of 
course, normally be smaller than the average number of 

30 members in each simple library, as otherwise the advantage 
of a library over individual synthesis is small. 

Libraries of thousands, even millions, of random 
oligopeptides have been prepared by chemical synthesis 
(Houghten et al.. Nature, 354:84-6(1991)), or gene 

35 expression (Marks et al., J Mol Biol, 222:581-97(1991)), 

displayed on chromatographic supports (Lam et al.. Nature, 
354:82-4(1991)), inside bacterial cells (Colas et al.. 
Nature, 380:548-550(1996)), on bacterial pili (Lu, 
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Bio/Technology, 13:366-372(1990)), or phage (Smith, Science, 
228:1315-7(1985)), and screened for binding to a variety of 
targets including antibodies (Valadon et al., J Mol Biol, 
261:11-22(1996)), cellular proteins (Schmitz et al., J Mol 
Biol, 260:664-677(1996)), viral proteins (Hong and 
Boulanger, Embo J, 14:4714-4727(1995)), bacterial proteins 
(Jacobsson and Frykberg, • Biotechniques, 18 : 878-885 (1995) ) , 
nucleic acids (Cheng et al., Gene, 171:1-8(1996)), and 
plastic (Siani et al., J Chem Inf Comput Sci, 34:588- 
593 (1994) ) . • • ' 

Libraries of proteins (Ladner, USP 4 , 664, 989) , • peptoids 
(Simon et al., Proc Natl Acad Sci USA, 89:9367-71(1992)), 
nucleic acids (Ellington and Szostak, Nature, 
246:818(1990)), carbohydrates, and small organic molecules 
(Eichler et al-, Med Res Rev, 15:481-96(1995)) have also 
been prepared or suggested for drug screening purposes - 

The first combinatorial libraries were composed of 
peptides or proteins, in which all or selected amino acid 
positions were randomized. • Peptides and proteins can exhibit 
high and specific binding activity, and can act as 
catalysts. In consequence, they are of great importance in 
biological systems . 

Nucleic acids have also been used in combinatorial 
libraries. Their great advantage is the ease with which a 
nucleic acid with appropriate binding activity can be 
amplified. As a result, combinatorial libraries composed of 
nucleic acids can be of low redundancy and hence, of high 
diversity . 

There has also been much interest in combinatorial 
libraries based on small molecules, which are more suited to 
pharmaceutical use, especially those which, like 
benzodiazepines, belong to a chemical class which has 
already yielded useful pharmacological agents. The 
techniques of combinatorial chemistry have been recognized 
as the most efficient means for finding small molecules that 
act on these targets. At present, small molecule 
combinatorial chemistry involves the synthesis of either 
pooled or discrete molecules that present varying arrays of 
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functionality on a common scaffold. These compounds are 
grouped in libraries that are then screened against the 
target of interest either for binding or for inhibition of 
biological activity. 
5 The size of a library is the number of molecules in it. 

The simple diversity of a library is the number of unique 
structures in it. There is no formal minimxim or maximum 
diversity. If the library has a very low diversity ^ the 
library has little advantage over just synthesizing and 

10 screening the members individually. If the library is of 
very high diversity, it may be inconvenient to handle, at 
least without automatizing the process. The simple 
diversity of a library is preferably at least 10, 10E2, 
10E3, 10E4, 10E6, 10E7, 10E8 or 10E9, ' the' higher the better 

15 under most circumstances • The simple diversity is usually 
not more than 10E15, and more usually not more- than lOElO. 

The average sampling level is the size divided by the 
simple diversity. The expected average sampling level must 
be high enough to provide a reasonable assurance that, if a. 

20 given structure were expected, as a consequence of the 
library design, to be present, that the actual average 
sampling level will be high enough so that the structure, if 
satisfying the screening criteria, will yield a positive 
result when the library is screened. Thus, the preferred 

25 average sampling level is a function of the detection Limit, 
which in turn is a function of the strength of the signal to 
be . screened - 

There are more complex measures of diversity than 
simple diversity- These attempt to take into account the 
30 degree of structural difference between the various unique 
sequences. These more complex measures are usually used in 
the context of small organic compound libraries, see below. 

The library members may be presented as solutes in 
solution, or immobilized on some foirm of support. In the 
35 latter case, the support may be living (cell, virus) or 

nonliving (bead, plate, etc.). The- supports may be separable 
(cells, virus particles, beads) so that binding and 
nonbinding members can be separated, or nonseparable 
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(plate) . In the latter case, the members will normally be 
placed on addressable positions on the support. The 
advantage- of a soluble library is that there is no carrier 
moiety that could interfere with the binding of the members 
to the support. The advantage of an immobilized library is 
that it is easier to identify the structure of the members 
which were positive. 

When screening a soluble libraryr or one with a 
separable support, the target is usually immobilized. When 
screening a library cJn a nonseparable support, the target 
will usually be labeled. 

Oligonucleotide Libraries . 

An oligonucleotide library is .a- combinatorial library, 
at least some of whose members are single-stranded 
oligonucleotides having three or more nucleotides connected 
by phosphodiester or analogous bonds. The oligonucleotides 
may be linear, cyclic or branched, and may include non- 
nucleic acid moieties . The nucleotides are not limited to 
the nucleotides normally found in DNA or RNA.- For examples 
of nucleotides modified to increase nuclease resistance and 
chemical stability of aptamers, see Chart 1 in Osborne and 
Ellington, Chem. Rev., 97: 349-70 (1997). For screening of 
RNA, see Ellington and Szostak, Nature, 346: 818-22 (1990) . 

There is no formal minimum or maximum size for these 
oligonucleotides. However, the number of conformations which 
an oligonucleotide can assume increases exponentially with 
its length in bases. Hence, a longer oligonucleotide is 
more likely to be able to fold to adapt itself to a protein 
surface. On the other hand, while very long molecules can 
be synthesized and screened, unless they provide a much 
superior affinity to that of shorter molecules, they are not 
likely to be found in the selected population, for the 
reasons explained by Osborne and Ellington (1997) . Hence, 
the libraries of the present invention are preferably 
composed of oligonucleotides having a length of 3 to 100 
bases, more preferably 15 to 35 bases. The oligonucleotides 
in a given library may be of the same or of different 
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lengths . 

Oligonucleotide libraries have the advantage that 
libraries of very high diversity (e.g., 10") are feasible, 
and binding molecules are readily amplified in vitro by 
5 polymerase chain reaction (PGR) . Moreover, nucleic acid 
molecules can have very high specificity and affinity to 
targets . 

In a preferred embodiment, this invention • prepares and 
screens oligonucleotide libraries. by the SELEX method, as 
.10 described in King and Famulok, Molec. Biol. Repts., 20: 97- 
107 (1994); L. Gold, C. Tuerk. Methods of producing nucleic 
acid ligands, US#5595877; Oliphant et al. Gene 44:177 
(1986) . 

The term "aptamer" is conferred on those 

15 oligonucleotides which bind the target protein. Such 

aptamers may be ^used to characterize the target protein, 
both directly (through identification of the aptamer and the 
points of contact between the aptamer and the protein) and 
indirectly (by use of the aptamer as a ligand to modify the 

20 chemical reactivity of the protein) . 

In a classic oligonuclotide, each nucleotide (monomeric 
unit) is composed of a phosphate group, a sugar moiety, and 
either a purine or a pyrimidine base. In DNA, the sugar is 
deoxyribose and in RNA it is ribose. The nucleotides are 

25 linked by 5 '-3' phosphodiester bonds. 

The deoxyribose phosphate backbone of DNA can be 
modified to increase resistance to nuclease and to increase 
penetration of cell membranes. Derivatives such as mono- or 
dithiophosphates, methyl phosphonates, boranophosphates, 

30 formacetals, carbamates, siloxanes, and dimethylenethio- 
sulfoxide©- and-sulf ono- linked species are known in the 
art . 



Peptide Library 

35 A peptide is composed of a plurality of amino acid 

residues joined together by peptidyl (-NHCO-) bonds. A 
biogenic peptide is a peptide in which the residues are all 
genetically encoded amino acid residues; it is not necessary 



wo 2004/013289 




PCT/US2003/024053 



that the biogenic peptide actually be produced by gene 
expression, 

Amino acids are the basic building blocks with which 
peptides and proteins are constructed. Amino acids possess 
both an amino group (-NHg) and a carboxylic acid group (- 
COOH) . Many amino acids, but not all, have the alpha amino 
acid structure NH2-CHR-COOH, where R is hydrogen, or any of a 
variety of functional groups • * * 

• Twenty amino acids are genetically encoded: Alanine, 
Arginine, Asparagine, Aspartic Acid, Cysteine, Glutamic 
Acid, Glutamine, Glycine, Histidine, Isoleucine, Leucine, 
Lysine, Methionine, Phenylalanine, * Proline, Serine, 
Threonine, Tryptophan, Tyrosine, and Valine. Of these, all 
save Glycine are optically isomeric, however, only, the L- 
form is found in humans. Nevertheless, the D-forms of these 
amino acids do have biological significance; D-Phe, for 
example, is a known analgesic. 

.Many other amino acids are also known, including: 2- 
Aminoadipic acid; 3-Aminoadipic acid; beta-Aminopropionic 
acid; 2-Aminobutyric acid; 4-Aminobutyric acid (Piperidinic 
acid) ; 6-Aminocaproic acid; 2-Aminoheptanoic acid; 2- 
Aminoisobutyric acid, 3-Aminoisobutyric acid; 2'-Aminopimelic 
acid; 2, 4-Diaminobutyric acid; Desmosine; 2,2'- 
Diaminopimelic acid; 2 , 3-Diaminopropionic acid; N- 
Ethylglycine; N-Ethylasparaginer Hydroxylysine; allo- 
Hydroxylysine ; 3-Hydr oxyproline ; 4 -Hydroxyproline ; 
Isodesmosine; allo-Isoleucine; N-Methylglycine (Sarcosine) ; 
N-Methylisoleucine; N-Methyl valine; Norvaline; Norleucine; 
and Ornithine. 

Peptides are constructed by condensation of amino acids 
and/or smaller peptides. The aicd.no group of one amino acid 
(or peptide) reacts with the carboxylic acid group of a 
second amino acid (or peptide) to form a peptide (-NHCO-) 
bond, releasing one molecule of water. Therefore, when an 
amino acid is incorporated into a peptide, it should, 
technically speaking, be referred to as an amino acid 
residue . The core of that residue is the moiety which 
excludes the -NH and -CO linking functionalities which 



wo 2004/013289 ^ 



40 



PCT/US2003/024053 



connect it to other residues. This moiety consists of one 
or more main chain atoms (see below) and the attached side 
chains . 

The main chain moiety of each amino acid consists of 
the -NH and -CO linking functionalities and a core main 
•chain moiety. Usually the latter is a single carbon atom. 
However, the core main chain moiety may include additional 
carbon atoms, and may also include nitrogen, oxygen or 
•sulfur atoms, which together form. a single chain. In a 
preferred embodiment, the core main. chain atoms consist 
solely of carbon atoms. 

The side chains are attached to the core main chain 
atoms. For alpha amino acids, ±n which the side chain is 
attached to the -alpha carbon, the C-1, C-2 and N-2 of each 
residue form the repeating unit of the main chain, and the 
word ^^side chain" refers to the C-3 and higher numbered 
carbon atoms and their substituents . It also includes H 
atoms attached to the main chain atoms. 

Amino acids may be classified according to the number 
of carbon atoms which appear in the main chain between the 
carbonyl carbon and amino nitrogen atoms which participate 
in the peptide bonds. Among the 150 or so amino acids which 
occur in nature, alpha, beta, gamma and delta amino acids 
are known. These have 1-4 intermediary carbons. Only alpha 
amino acids occur in proteins. Proline is a special case of 
an alpha amino acid; its side chain also binds to the 
peptide bond nitrogen. : 

For beta and higher order amino acids, there is a 
choice as to which main chain core carbon a side chain other 
than H is attached to. The preferred attachment site is the 
C-2 (alpha) carbon, i.e., the one adjacent to the carboxyl 
carbon of the -CO linking functionality. It is also possible 
for more .than one main chain atom to carry a side chain 
other than H. However, in a preferrred embodiment, only one 
main chain core atom carries a side chain other than H. 

A main chain carbon atom may carry either one or two 
side chains; one is more common. A side chain may be 
attached to a main chain carbon atom by a single or a double 



wo 2004/013289 




PCTAJS2003/024053 



41 



bond; the former is more common. 

A simple combinatorial peptide library is one whose 
members are peptides having three or more amino acids 
connected vid peptide bonds . 

The peptides may be linear, branched, or cyclic, and 
may covalently or noncovalently include nonpeptidyl 
moieties. The amino acids are not limited to the naturally 
occurring or to the genetically encoded amino acids- 

A biased peptide library is one in which one or more 
(but not all)- residues- of the peptides are constant 
residues. 

Cyclic Peptides 

Many naturally occurring peptides are cyclic. 
Cyclization is a common mechanism for stabilization of 
peptide .conformation thereby achieving improved association 
of the peptide with its ligand and hence improved biological 
activity. Cyclization is usually achieved by intra-chain 
cystine formation, by formation of peptide bond between side 
chains or between N- and terminals. Cyclization was 
usually achieved by peptides in solution, but several 
publications have appeared that describe cyclization of 
peptides on beads . 

A peptide library may be an oligopeptide library or a 
protein library. 

Oligopeptides 

Preferably, the oligopeptides are at least five, six, 
seven or eight amino acids in length. Preferably, .they are 
composed of less than 50, more preferably less than 20 amino 
acids . 

In the case of an oligopeptide library, all or just 
some of the residues may be variable. The oligopeptide may 
be unconstrained, or constrained to a particular 
conformation by, e.g., the participation of constant 
cysteine residues in the formation of a constraining 
disulfide bond. 
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Proteins 

Proteins, like oligopeptides, are composed of a 
plurality of amino acids, but the term protein is usually 
reserved for longer peptides, which are able to fold into a . 
5 stable conformation. A protein may be composed of two or 
more polypeptide chains, held together by covalent or 
noncovalenl: crosslinks. 

A peptide is considered a protein if it (1) is at least 
50. amino acids long, or (2) has at least two stabilizing 
10 covalent crosslinks (e.g., disulfide bonds). Thus, 
conotoxins are considered proteins. 

Usually, the proteins of a protein library will be 
characterizable as having both constant residues (the same 
for all proteins in the library) and Variable residues 
15 (which vary from member to member) . This is simply because, 
for a given range of variation at each position, the 
sequence space (simple diversity) grows exponentially with 
the niimber of residue positions, so at some point it becomes 
inconvenient for all residues of a peptide to be variable 
20 positions. Since proteins are usually larger than 

oligopeptides, it is more common for protein libraries than 
oligopeptide libraries to feature variable positions. 

In the case of a protein library, it is desirable to 
focus the mutations at those sites which are tolerant of 
25 mutation. These may be determined by alanine scanning 

mutagenesis or by comparison of the protein sequence to that 
of homologous proteins of similar activity. It is also more 
likely that mutation of surface residues will directly 
affect binding. Surface residues may be determined by 
30 inspecting a 3D structure of the protein, or by labeling the 
surface and then ascertaining which residues have received 
labels. They may also be inferred by identifying regions of 
high hydrophilicity within the protein. 

Because proteins are often altered at some sites but 
35 not others, protein libraries can be considered a special 
case of the biased peptide library. 

There are several reasons that one might screen a 
protein library instead of an oligopeptide library. 
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including (1) a particular protein, mutated in the library, 
has the desired activity to some degree already, and (2) the 
oligopeptides are not expected to have a sufficiently high 
affinity or specificity since they do not have a stable 
5 conformation. 

When the protein library is based on a parental protein 
which does not have the desired activity, the parental 
protein will usually' be one which is of high stability 
(melting point >= 50 deg. C.) and/or possessed of 

10 hypervariable regions. 

The variable domains of an antibody possess 
hypervariable regions and hence, in some embodiments, the 
protein library comprises members which comprise- a mutant of 
VH or VL chain, or a mutant of an antigen-specific binding 

15 fragment of such a chain. VH and VL chains are usually each 
about 110 amino acid residues, and are held in proximity by 
a disulfide bond between the adjoing CL and CHI regions to 
form a variable domain. Together^ the VH, VL, CL and CHI 
form an Fab fragment. 

20 In human heavy chains, the hypervariable regions are at 

31-35, 49-65, 98-111 and 84-88, but only the first three are 
involved in antigen binding. There is variation among VH 
and VL chains at residues outside the hypervariable regions, 
but to a much lesser degree. 

25 A sequence is considered a mutant of a VH or VL chain 

if it is at least 80% identical to a naturally occurring VH 
or VL chain at all residues outside the hypervariable 
region. 

In a preferred embodiment, such antibody library 
30 members comprise both at least one VH chain and at least one 
VL chain, at least one of which is- a mutant chain, and which 
chains may be derived from the same or different antibodies. 
The VH and VL chains may be covalently joined by a suitable 
linker moiety, as in a ^^single chain antibody", or they may 
35 be noncovalently joined, as in a naturally occurring 
variable domain . 

If the joining is noncovalent, and the library is • 
displayed on cells or virus, then either the VH or the VL 
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chain may be fused to the carrier surface/coat protein. The 
complementary chain may be co-expressed, or added 
exogenously to the library. 

The members may further comprise some or all of an 
5 antibody constant- heavy and/or constant light chain, or a 
mutant thereof. 

Peptoid Library 

A peptoid is an analogue of a peptide in which one or • 
10 more, of the peptide bonds (-NH-CO-) are replaced by 

pseudopeptide bonds, . which may be the same- or different. It 
is not necessary that all of the peptide bonds be replaced, 
i.e., a peptoid may include one or more conventional amino 
acid residues,' e.g., proline. 
15 A peptide bond* has two small divalent linker elements, 

-NH- and -CO-. Thus, a preferred class of psuedopeptide 
bonds are those which consist of two small divalent linker 
elements. Each may be chosen independently from the group 
consisting of amine (-NH-) , substituted amine (-NR-) f 
20 carbonyl (-CO-) , thiocarbonyl (-CS-) , methylene {-CH2-) , 

monosubstituted methylene (-CHR-) , disubstituted methylene 
(-CR1R2-), ether (-0-) and thioether {-S-) . The more * 
preferred pseudopeptide bonds include: 
N-modified -NRCO- 
25 Carba ^ -CH2-CH2- 

Depsi W -CO-0- 
Hydroxyethylene ^ -CHOH-CH2- 
Ketomethylene H -CO-CHg- 
Methylene-Oxy -CH2-O- 
30 Reduced -CHj-NH- 

Thiomethylene -CH2-S- 
Thiopeptide -CS-NH- 
Retro-Inverso -CO-NH- 

35 A single peptoid molecule may include more than one 

kind of pseudopeptide bond. 

For the purposes of introducing diversity into a 
peptoid library, one may vary (1) the side chains attached 
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to the core main chain atoms of the monomers linked by the 
pseudopeptide bonds, and/or (2) the side chains (e.g., the - 
R of an -NRCO-) of the pseudopeptide bonds. Thus, in one 
embodiment, the monomeric units which are not amino acid 
5 residues are of the structure ~NRl-CR2-CO-, where at least 
one of Rl and R2 are not hydrogen. If there is variability 
in the pseudopeptide bond, this is most conveniently done by 
using an -NRCO- or other pseudopeptide bond with an R group, 
and varying the R group. In thi.s event, the R group will 

10 usually be any of the side chains characterizing the amino 
acids of peptides, as previously discussed. 

• If the R group of. the pseudopeptide bond is not 
variable, it will usually be small, e.g., not more than 10 
atoms (e.g., hydroxyl, amino, carboxyl, methyl, ethyl, 

15 propyl). 

If the conjugation chemistries are compatible, a simple 
combinatorial library may include bpth peptides and 
peptoids . 

20 Peptide Nucleic Acid Library 

" A PNA oligomer is here defined as one comprising a 
plurality of units, at least one of which is a PNA monomer 
which comprises a side chain comprising a nucleobase. For 
nucleobases, see USP 6,077,835. 

25 The classic PNA oligomer is composed of (2- 

aminoethyl) glycine units, with nucleobases attached by 
methylene carbonyl linkers. * That is,- it has the structure 

H- (-HN-CH2-CH2-N(.-CO-CH2rB)-CH2-CO-)n -OH 

30 

where the outer parenthesized substructure is the PNA 
monomer. 

In this structure, the nucleobase B is separated from 
35 the backbone N by three bonds, and the points of attachment 
of the side chains are separated by six bonds. The 
nucleobase may be any of the bases included in the 
nucleotides discussed in connection with oligonucleotide 
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libraries. The bases of nucleotides A, C and U are 

preferred. 

A PNA oligomer may further comprise one or more amin'o 
acid residues, especially glycine and proline. 

One can readily envision related molecules in which (1) 
the -C0CH2- linker is replaced by another linker, especially 
one composed of two small divalent linkers as defined 
previously, (2) a side chain is attached to one of the three 
main chain carbons not participating in the peptide bond 
(either instead or in addition .to the. side chain attached to 
the N of the classic PNA); and/or (3) the peptide bonds are 
replaced by pseudopeptide bonds as disclosed previously in 
the context of peptoids. 

PNA oligomer libraries have been made; see e.g. Cook, 
6,204,326. 

Small Org anic Compound Library 

The small organic compound library ("compound library", 
for short) is a combinatorial library whose members are 
suitable for use as drugs if, indeed, they have the ability 
to mediate a biological activity of the target protein. 

Peptides have certain disadvantages as drugs. These 
include susceptibility to degradation by serum proteases, 
and difficulty in penetrating cell membranes. Preferably, 
all or most of the compounds of the compound library avoid, 
or at least do not suffer to the same degree, one or -more of 
the pharmaceutical disadvantages of peptides. 

In designing a compound library, it is helpful to bear 
in mind the methods of molecular modification typically used 
to obtain new drugs. Three basic kinds of modification may 
be identified: disjunction , in which a lead drug is 
simplified to identify its component pharmacophoric 
moieties; nnni unction , in which two or more known 
pharmacophoric moieties, which may be the same or different, 
are associated, covalently or noncovalently, to form a new 
drug; and alteration , in which one moiety is replaced by 
another which may be similar or different, but which is not 
in effect a disjunction or conjunction. The use of the 
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terms "disjunction", "conjunction" and "alteration" is 
intended only to connote the structural relationship of the 
end product to the original leads, and not how the new drugs 
are actually synthesized, although it is possible that the 
5 two are the same. 

The process of disjunction is illustrated by the 
evolution of neostigmine (1931) and edrophonixim (1952) from 
physostigmine (1925) . Subsequent conjunction is illustrated 
•by demecarium. (1956) and ambenonium (1956) . • • " 

10. • Alterations may modify the size, polarity, or electron 

distribution of an original moiety. Alterations include 
ring closing or opening, formation of lower or higher 
homologues, introduction or saturation of double bonds, 
introduction of optically active centers, introduction, 

15 removal or replacement of bulky groups, isosteric or 
bioisosteric substitution, changes in the. position or 
orientation of a group, introduction of alkylating groups, 
and introduction, removal or replacement of groups with a 
view toward inhibiting or promoting inductive 

20 (electrostatic) or conjugative (resonance) effects. 

Thus, the substituents may include electron acceptors 
and/or electron donors. Typical electron donors (+1) 
include -CH3, -CHjR, -CHR2, -CR3 and -COO". Typical electron 
acceptors (-1) include -NH3+, -NR3+, -NO2, -CN, -COOH, -COOR, 

25 -CHO, -COR, -COR, -F, -CI, -Br, -OH, -OR, -SH, -SR, -CH=CH2, 
-CR=CR2, and -C=CH. 

The substituents may also include those which increase 
or decrease electronic density in conjugated systems. The 
former (+R) groups include -CH3, -CR3, -F,*-C1, -Br, -I, -OH, 

30 -OR, -OCOR, -SH, -SR, -NH2, -NRj, and -NHCOR. The later (-R) 
groups include -NO2, -CN, -CHC, -COR, -COOH, -COOR, -CONH2, 
-SO2R and -CF3. 

Synthetically speaking, the modifications may be 
achieved by a variety of unit processes, including 

35 nucleophilic and electrophilic substitution, reduction and 
oxidation, addition elimination, double bond cleavage, and 
cyclization. 

For the purpose of constructing a library, a compound. 
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or a family of compounds, having one or more pharmacological 
activities (which need not be related to the known or 
suspected activities of the target protein) , may be 
disjoined into two or more .known or potential pharmacophoric 
moieties. Analogues of each of these moieties may be 
identified, and mixtures of these analogues reacted so as to 
reassemble compounds which have some similarity to the 
original lead compound. It is not necessary that all 
members of the library possess moieties analogous to all of 
the moieties of the lead compound. 

The design of a library may be illustrated by the 
example of the benzodiazepines. Several benzodiazepine 
drugs, including chlordiazepoxide, diazepam and oxazepam, 
have been used as anti-anxiety drugs. Derivatives of 
benzodiazepines have widespread biological activities; 
derivatives have been reported to act not only as 
anxiolytics, but also as anticonvulsants; cholecystokinin 
(CCK) receptor subtype A or B, kappa opioid receptor, 
platelet activating factor, and HIV transactivator Tat 
antagonists, and GPIIblla, reverse transcriptase and ras 
. farnesyltransf erase inhibitors. 

The benzodiazepine structure has been disjoined into a 
2-aminobenzophenone, an amino acid, and an alkylating agent. 
See Bunin, et al . , Proc. Nat. Acad. Sci . USA, 91:4708 
(1994) . Since only a few 2-aminobenzophenone derivatives 
are commercially available, it was later disjoined into 2- 
aminoarylstannane, an acid chloride, an amino acid, and an 
alkylating agent. Bunin, et al., Meth. Enzymol., 267:448 
(1996) . The arylstannane may be considered the core 
structure upon which the other moieties are substituted, or 
all four may be considered equals which are conjoined to 
make each library member. 

A basic library synthesis plan and member structure is 
shown in Figure 1 of Fowlkes, et al., U.S.' Serial No. 
08/740,671, incorporated by reference in its entirety. The 
acid chloride building block introduces variability at the R 
site. The site is introduced by the amino acid, and the 
R3 site by the alkylating agent. The R< site is inherent in 
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the arylstannane . Bunin, et al. generated a 1, 4- 
benzodiazepine library of 11,200 different derivatives 
prepared from 20 acid chlorides, 35 amino acids, and 16 
alkylating agents. (No diversity was introduced at R*; this 
group was used to couple the molecule to a solid phase.) 
According to the Available Chemicals Directory (HDL 
Information Systems, San Leandro CA) , over .300 acid 
chlorides, 80 Fmoc-protected amino acids and 800 alkylating 
agents were available for purchase (and more, of course, 
could be synthesized) . The particular moieties used were 
chosen to maximize structural dispersion, while limiting the 
numbers to those conveniently synthesized in the wells of a 
microtiter plate. In choosing between structurally similar 
compounds, preference was given to the least substituted 
coit^ound . 

The variable elements included both aliphatic and 
aromatic groups. Among the aliphatic groups, both acyclic 
and cyclic (mono- or poly-) structures, substituted or not, 
were tested. (While all of the acyclic groups were linear, 
it would have been feasible to introduce a branched 
aliphatic) . The aromatic groups featured either single and 
multiple rings, fused or not, substituted or not, and with 
heteroatoms or not. The secondary substitutents included - 
NHj, -OH, -OMe, -CN, -CI, -F, and -COOK. While not used, 
spacer moieties, such as -0-, -S-, -00-, -CS-, -NH-, and - 
NR-, could have been incorporated. 

Bunin et al. suggest that instead of using a 1, 4- 
benzodiazepine as a core structure, one may instead use a 1, 
4-benzodiazepine-2, 5-dione structure. 

As noted by Bunin et al., it is advantageous, although 
not necessary, to use a linkage strategy which leaves no 
trace of the linking functionality, as this permits 
construction of a more diverse library. 

Other combinatorial nonoligomeric compound libraries 
known or suggested in the art have been based on carbamates 
mercaptoacylated pyrrolidines, phenolic agents, aminimides, 
N-acylamino ethers (made from amino alcohols, aromatic 
hydroxy acids, and carboxylic acids), N-alkylamino ethers 
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(made from aromatic hydroxy acids, amino alcohols and 
aldehydes) 1, 4-piperazines, and 1, 4-piperazine-6-ones . 

DeWitt, et al., Proc. Nat. Acad. Sci. (USA), 90:6909^13 
(1993) describe the simultaneous but separate, synthesis of 
40 discrete hydantoins and 40 discrete benzodiazepines. 
They carry out their synthesis on a solid support (inside a 
gas dispersion tube) , in an array format, as opposed to 
other conventional simultaneous synthesis techniques (e.g., 
in a well, or on a pin) . The hydantoins were synthesized by 
first simultaneously deprotecting and then treating each of 
five amino acid resins -with each of eight isocyanates . The 
benzodiazepines were synthesized by treating each of five 
deprotected amino acid resins with each of eight 2-amino 
benzophenone imines. 

Chen, et al., J. Am. Chem. Soc, 116:2661-62 (1994) 
described the preparation of a pilot (9 member) 
combinatorial library, of formate esters. A polymer bead- 
bound aldehyde preparation was "split" into three aliquots, 
each reacted with one of three different ylide reagents. 
The reaction products were combined, and then divided into 
three new alicjuots, each of which was reacted with a 
different Michael donor. Compound identity was found to be 
de'terminable on a single bead basis by gas 
chromatography /mass spectroscopy analysis. 

Holmes,. USP 5,549,974 (1996) sets forth methodologies 
for the combinatorial synthesis of libraries of 
thiazolidinones and metathiazanones . These libraries are 
made by combination of amines, carbonyl compounds, and 
thiols under cyclization conditions. 

Ellman, USP 5,545,568 (1996) describes combinatorial 
synthesis of benzodiazepines, prostaglandins, beta-turn 
mimetics, and glycerol-based compounds. See also Ellman, 

USP 5,288,514. 

Summerton, USP 5,506,337 (1996) discloses methods of 
preparing a combinatorial library formed predominantly of 
morpholino subunit structures. 

Heterocylic combinatorial libraries are reviewed 
generally in Nefzi, et al., Chem. Rev., 97:449-472 (1997). 
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One or more moieties of the following types may be 
incorporated into compounds of the library, as many drugs 
fall into one or more of the following categories: 

acetals 
acids 
alcohols 
amides 
ami dines 
15 amines 

amino acids 
amino alcohols 

20 

amino ethers 
amino ketenes I 
25 ammonium compounds 

azo compounds 
enols 

30 

esters 
ethers 
35 glycosides 
guanidines 

halogenated compounds 

40 

hydrocarbons 
ketones 
45 lactams 
lactones 
mustards 

50 

nitro compounds 
nitroso compounds 
55 organo minerals 
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phenones • 

guinones 

semicarbazones 

stilbenes 

sulfonamides 

sulfones 

thiols 

thioamides 

thioureas 

ureas 

ureides 

urethans 

Without attempting to exhaustively recite all 
pharmacological classes of drugs / or all drug structures, 
one or more compounds of the chemical structures listed 
below have been found to exhibit the indicated 
pharmacological activity, and these structures, or 
derivatives, may be used as design elements in screening for 
further compounds of the same or different activity. (In 
some cases, one or more lead drugs of the class are 
indicated. ) 

hypnotics 

higher alcohols (clomethiazole) 

aldehydes (chloral hydrate) 

carbamates (meprobamate) 

acyclic ureides (acetylcarbromal) 

barbiturates (barbital ) 

benzodiazepine (diazepam) 

anticonvulsants 

barbiturates (phenobarbital ) 
hydantoins (phenytoin) 
oxazolidinediones (trimethadione) 
succinimides (phensuximide) 
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acylureides (phenacemides ) 

narcotic analgesics 
morphines 

phenylpiperidines (meperidine ) 
diphenylpropyl amines (methadone) 
phenothiazihes (methotrimeprazine) 

analgesics^ antipyretics, antirheumatics 
salicylates (acetylsalicylic acid) 
p-aminophenol ( acetaminophen ) 
5 -py r a zol one ( dipy r one ) 

3 f 5-pyrazolidinedione (phenylbutazone ) 
arylacetic acid (indomethacin) 

adrenocortical steroids (cortisone, dexamethasone, 
prednisone, triamcilone) 
athranilic . acids 

neuroleptics 

phenothiazine (chlorpromazine) 
thioxanthene ( chlorprothixene ) 
reserpine 

butyrophenone (halopendol) 

anxiolytics 

propandiol carbamates (meprobamate) 
benzodiazepines ' (chlordiazepoxide, diazepam, 

oxazepam) 

antidipressants 

tricyclics (imipramine) 

muscle/relaxant s 

propanediols and carbamates (mephenesin) 

CNS stimulants 

xanthines (caffeine, theophylline) 
phenylalkylamines (amphetamine) 
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(Fenetylllne is a conjunction of theophylline and 
amphetamine) 

oxazolldlnones (pemoline) 
cholinergics 

choline esters (acetylcholine) 
N-dlmethylcarbamates 

adrenergics 

aromatic amines ( epinephrine / . Isoproterenol, 

phenylephrine) 
allcycllc ainlnes (cyclopent amine) 
aliphatic amines (methylhexaneamlne ) 
Imlda zol Ine s ( napha z ol ine ) 

antl-adrenerglcs 

Indolethylamlne alkaloids (dlhydroergotamlne) 
Imidazoles (tolazollne) 
benzodloxans (plperoxan) 
beta-haloalkylamlnes (phenoxybenzamlne) 
dlbenzazeplnes (azapetlne) 
hydrazlnophthalazlnes (hydralazine) 

antihistamines 

ethanolamines ( diphenhydramine ) 
ethylenedlamines ( tripelennomine) 
alkylamines (chlorpheniramine) 
piperazines (cyclizine) 
phenothiazines (promethazine) 

local anesthetics 
benzoic acid 

esters (procaine, Isobucalne, cyclomethycalne) 
basic amides (dlbucaine) 

anllldes, toluldldes, 2, 6-xylldldes (lldocalne) 
tertiary amides (oxetacalne) 



vasodilators 

polyol nitrates (nitroglycerin) 
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10 



15 



20 



25 



30 



diuretics 

xanthines 

thiazides (chlorothiazide) 
sulfonamides (chlorthalidone) 

antihelmintics 

cyanine dyes 

antimalarials 

4-aminoquinolines 

8-aminoquinolines 

pyrimidines 

biguanides 

acridines 

dihydrotriazines 

sulfonamides 

sulf ones 

antibacterials 

antibiotics 
penicillins 
cephalosporins 

octahydronapthacenes (tetracycline) 
sulfonamides 
nitrofurans 
cyclic amines 
naphthyridines 
xylenols 

antitumor 

alkylating agents 
nitrogen mustards 
aziridines 

methane sulfonate esters 

epoxides 
amino acid antagonists 
folic acid antagonists 
pyrimidine antagonists 



56 



PCT/US2003/0240S3 



z antagonists 

.itanes 
.-•sides 

^micarbazones 
:.es 

.es and guanidines 
Inclines 

idazoles 
-:2ines . 

rological classes, see, e.g., Goth, Medical 
■-" inciples and Concepts (C.V, Mosby Co.: 8th 
Dcovas and Burckhalter, Essentials of 
; try (John Wiley & Sons, Inc. : 1976) . For 
:.3, see, e.g.^ Warren, Organic Synthesis: The 
-/oroach (John Wiley & Sons, Ltd.: 1982); 
' of Organic Compounds (John Wiley & Sons: 
:1 Payne, How to do an Organic Synthesis 
1, Inc.: 1969); Greene, Protective Groups in 
v5 (Wiley-Interscience) . For selection of 
-.e e.g., Hansch and Leo, Substituent 
r rrelation Analysis in Chemistry and Biology 
.as: 1979) . 

; is preferably synthesized so that the 
^rs remain identifiable so that, if a member 
\ctive, it is not necessary to analyze it. 
of identification have been proposed, 

Ing, i.e., the attachment to each member of 

^ntifier moiety which is more readily 

.fied than the member proper. This has the 

vantage that the tag may itself influence the 

.ty of the conjugate. 

.1 addressing, e.g., each member is 

-.sized only at a particular coordinate on or 

:atrix, or in a particular chamber. This 
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might be, for example, the location of a 
particular pin, or a particular well on a 
microtiter plate, or inside a "tea bag". 
The present invention is not limited to any particular form 
of identif ication. 

However, it is possible to simply characterize those 
members of the library which are found to be active, .based 
on the characteristic spectroscopic indicia of the various 
building blocks. 

• Solid phase synthesis permits greater control over • 
which derivatives are formed. However, the solid phase 
could interfere with activity. To overcome this problem, 
jsome or all of the molecules of each member could be 
liberated, after synthesis but before screening. 

Examples of candidate simple libraries which might .be 
evaluated include derivatives of the following: 
Cyclic Compounds Containing One Hetero Atom 
Heteronitrogen 
pyrroles 

pentasubstituted pyrroles 
pyrrolidines 
pyrrolines 
prolines 
indoles 

beta-carbolines 
pyridines 

dihydropyridines 

1, 4 -dihydropyridines 

pyrido [2, 3-d] pyrimidines 

tetrahydro-3H-imidazo [ 4 , 5-c] pyridines 
Isoquinolines 

tetrahydroisoquinolines 
guinolones 
beta-lactams 

a2abicyclo[4 .3.0]nonen-8-one amino acid 
Heterooxygen 
furans 

tetrahydrof urans 
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2, 5-disubstituted tetrahydrof urans 

pyrans 

hydroxypyranones 

tetrahydroxypyranones 
gamma -butyrolactones 
Heterosulfur 

sulf olenes 

Cyclic Compounds with. Two or More Hetero atoms 
Multiple heteronitrogens 
imidazoles 
pyrazoles 
piperazines 

diketopiperazines 

arylpiperazines 

benzylpiperazines 
benzodiazepines 
1 , 4"benzodiazepine-2 , 5-diones 
hydantoins 

5-alkoxyhydantoins 
dihydropyrimi dines 

1, 3-disubstituted-5, 6-dihydopyrimidine-2, 4- 

diones 
cyclic ureas 
cyclic thioureas 
quinazolines 

chiral 3-substituted-quinazoline-2 , 4- 

diones 

triazoles 

1, 2, 3-triazoles 
purines 

Heteronitrogen and Heterooxygen 

di kelomorphol ine s 

isoxazoles 

isoxazolines 
Heteronitrogen and Heterosulfur 

thiazolidines 

N-axylthiazolidines 
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dihydr ot hi a 2 ol e s 

2-inethylene-2 , 3~dihydrothiazates 

2-aminothia2oles 
thiophenes 

5 3-amino thiophenes 

4-thiazolidinones 
4-nielathiazanones 
benzisothiazolones 
For details on synthesis. of libraries, see Nefzi, et 
10 al., Chem. Rev., 97:449-72 (1997.), and references cited 
therein. 

Pharmaceu-bical Methods and * Preparations 

The preferred animal subject of the present invention 

15 is a maminal. By the term "maitmial" is meant an individual • 
belonging to the class Mammalia. The invention is 
particularly useful in the treatment of human subjects, 
although it is intended for veterinary and nutritional uses 
as well. Preferred nonhuman subjects are of the orders 

20 Primata (e.g., apes and monkeys), Artiodactyla or 

Perissodactyla (e.g., cows, pigs, sheep, horses, goats), 
•Carnivora (e.g., cats, dogs), Rodenta (e.g., rats, mice, 
.guinea pigs, hamsters), Lagomorpha (e.g., rabbits) or other 
pet, farm or laboratory mammals. 

25 The term "protection", as used herein, is intended to 

include "prevention," "suppression" and "treatment." 
"Prevention", strictly speaking, involves administration of 
the pharmaceutical prior to the ind uction of the disease, (or 
other adverse clinical condition) . "Suppression" involves 

30 administration of the composition prior to the clinical 
appearance of the disease. "Treatment" involves 
administration of the protective composition after the 
appearance of the disease. 

It will be understood that in hximan and veterinary 

35 medicine, it is not always possible to distinguish between 
"preventing" and "suppressing" since the ultimate inductive 
event or events may be unknown, latent, or the patient is 
not ascertained until well after the occurrence of the event 
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or events • Therefore, unless qualified, the term 
"prevention" will be understood to refer to both prevention 
in the strict sense, and to suppression. 

The preventative or prophylactic use of a 
pharmaceutical .involves identifying subjects who are at 
higher risk than the general population of contracting^ the 
disease, and administering the pharmaceutical to them in 
advance of the clinical appearance of the disease. The 
effectiveness of such use is measurfed by comparing the ' 
subsequent incidence or severity of the disease, or" of 
particular symptoms of the disease, in the treated subjects 
against that in untreated subjects of the same high risk 
group . 

While high risk factors vary from disease to disease, 
in general, these include (1) prior occurrence of the 
disease in one or more members of the same family, or, in 
the case of a contagious disease, in individuals with whom 
the subject has come into potentially contagious contact at 
a time when the earlier victim was likely to be contagious, 
(2) a prior occurrence of the disease in the subject, (3) 
prior occurrence of a related disease, or a condition known 
to increase the likelihood of the disease, in the subject; 
(4) appearance of a suspicious level of a marker of the 
disease, or a related disease or condition; (5) a subject 
who is immunologically compromised, e.g., by radiation 
treatment, HIV infection, drug use,, etc,, or (6) membership 
in a particular group (e.g., a particular age, sex, race, 
ethnic group, etc.) which has been epidemiologically 
associated with that disease. 

A prophylaxis or treatment may be curative, that is, 
directed at the underlying cause of a disease, or 
ameliorative, that is, directed at the symptoms of the 
disease, especially those which reduce the quality of life. 

It should also be understood that to be useful, the 
protection provided need not be absolute, provided that it 
is sufficient to carry clinical value. An agent which 
provides protection- to a lesser degree than do competitive 
agents may still be of value if the other agents are 
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ineffective for a particular individual, if it can be used 
in combination with other agents to enhance the level of 
protection, or if it is safer than competitive agents. It is 
desirable that there be .a statistically significant (p=0.05 
or less) improvement in the treated subject relative to an 
appropriate untreated control, and it is desirable that this 
improvement be at least 10%, more preferably at least 25%, 
still more preferably at least 50%, .even more preferably at 
least 100%, in some- indicia of the incidence or severity of 
the disease or of at least one symptom of the disease. 
. • • At least one of the drugs of the present invention may 
be administered, by any means that achieve their intended 
purpose/ to protect a subject against a disease or other 
adverse condition. The form of administration may be 
systemic or topical. For example, administration of such a 
composition may be by various parenteral routes such as 
subcutaneous, intravenous, intradermal, intramuscular, 
intraperitoneal, intranasal, transdermal, or buccal routes. 
Alternatively, or concurrently, administration may be by the 
oral route. Parenteral administration can be by bolus 
injection or by gradual perfusion over time. 

A typical regimen comprises administration of an 
effective amount of the drug, administered over a period 
ranging from a single dose, to dosing over a period of 
hours, days, weeks, months, or years. 

It is understood that the suitable dosage of a drug of 
the present invention will be dependent upon the age, sex, 
health, and weight of the recipient, kind of concurrent 
treatment, if any, frequency of treatment, and the nature of 
the effect desired. However, the most preferred dosage can 
be tailored to the individual subject, as is understood and 
determinable by one of skill in the art, without undue 
experimentation. This will typically involve adjustment of 
a standard dose, e.g., reduction of the dose if the patient 
has a low body weight. 

Prior to use in humans, a drug will first be evaluated 
for safety and efficacy in laboratory animals. In human 
clinical studies, one would begin with a dose expected to be 
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safe in humans, based on the preclinical data for the drug 
in question, and on customary doses for analogous drugs (if 
any) . If this dose is effective, the dosage may be 
decreased, to determine the minimum effective dose, if 
desired. If this dose is ineffective, it will be cautiously 
increased, with the patients monitored for signs of side 
effects. See, e.g., Berkow et al, eds . , The Merck Manual, 
15th- edition, Merck and Co., Rahway, N.J., 1987; Goodman et 
al., eds., Goodman and Gllman^s The Pharmacologioal Basis of 
TherapeuticSf 8th edition, Pergamon Press, Inc., Elfflaford, 
N.Y., (1990); Avery's Drug Treatment: Principles and 
Practice of Clinical Pharmacology and Therapeutics/ 3rd 
edition, ADIS Press, LTD*, Williams and Wilkins, Baltimore, 
MD. (1987), Ebadi, Pharmacology^ Little, Brown and Co., 
Boston, (1985) / which references and references cited 
therein, are entirely incorporated herein by reference. 

The total dose required for each treatment may be 
administered by multiple doses or in a single dose. The 
protein may be administered alone or in conjunction with 
other therapeutics directed to the disease or directed to 
other symptoms thereof. 

The appropriate dosage form will depend on the disease, 
the pharmaceutical, and the mode of administration; 
possibilities include tablets, capsules, lozenges, dental 
pastes, suppositories, inhalants, solutions, ointments, and 
parenteral depots. See, e.g., Berker, supra, Goodman, 
supra^ Avery, supra and Ebadi, supra, which are entirely 
incorporated herein by reference, including all references 
cited therein. 

In the case of peptide drugs, the drug may be 
administered in the form of an expression vector comprising 
a nucleic acid encoding the peptide; such a vector, after in 
corporation into the genetic complement of a cell of the 
patient, directs synthesis of the peptide. Suitable vectors 
include genetically engineered poxviruses (vaccinia) , 
adenoviruses, adeno-associated viruses, herpesviruses and 
lentiviruses which are or have been rendered nonpathogenic. 

In addition to at least one drug as described herein, a 
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pharmaceutical composition may contain suitable 
pharmaceutically acceptable carriers, such as excipients, 
carriers and/or auxiliaries which facilitate processing of 
the active compounds into preparations which can be used 
pharmaceutically. See, e.g., Berker/ supra, Goodman, supra, 
Avery, supra and Ebadi, supra, which are entirely 
incorporated herein by reference, included all references 
cited therein. 

Assay Compos I'bxons and He'bhods 

Target Organism 

The ifivention contemplates that it may be appropriate 
to ascertain or to mediate the biological activity of a 
substance of this invention in a target organism. 

The target organism may be a plant, animal, or 
micr oor gani sm . 

In the case of a plant, it may be an economic plant, in 
which case the drug may be intended to increase the disease, 
weather or pest resistance, alter the growth 
characteristics, or otherwise improve the useful 
characteristics or mute undesirable characteristics of the 
plant. Or it may be a weed, in which case the drug may be 
intended to kill or otherwise inhibit the growth of the 
plant, or to alter its characteristics to convert it from a 
weed to an economic plant. The plant may be a tree, shrub, 
crop, grass, etc. The plant may be an algae (which are in 
some cases also microorganisms), or a vascular plant, 
especially gymnosperms (particularly conifers) and 
angiosperms. Angiosperms may be monocots or dicots. .The 
plants of greatest interest are rice, wheat, corn, alfalfa, 
soybeans, potatoes, peanuts, tomatoes, melons, apples, 
pears, plums, pineapples, fir, spruce, pine, cedar, and oak. 

If the target organism is a microorganism, it may be 
algae, bacteria, fungi, or a virus (although the biological 
activity of a virus must be determined in a virus-infected 
cell) . The microorganism may be human or other animal or 
plant pathogen, or it may be nonpathogenic. It may be a 
soil or water organism, or one which normally lives inside 
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other living things. 

If the target organism is an animal^ it may be a 
vertebrate or a nonvertebrate animal. Nonvertebrate animals 
are chiefly of interest when they act as pathogens or 
5 parasites, and the drugs are intended to act as biocidic or 
biostatic agents. Nonvertebrate animals of interest include 
worms, mollusks, and arthropods. 

The target organism may also be a vertebrate animal, 
i.e., a mammal, bird, reptile, fish or amphibian. Among 

10 mammals, the target animal preferably belongs to the order 
Primata (humans, apes and monkeys), Artiodactyla (e.g., 
•cows, pigs, sheep/ goats, horses), Rodenta (e.g./ mice, 
rats) Lagomorpha (e.g., rabbits., hares), or Carnivora. (e.g., 
cats, dogs) . Among birds, the target animals are preferably 

15 of the orders Anseriformes (e.g., ducks, geese, swans) or 
Galliformes . (e.g.,. quails, grouse, pheasants, turkeys and 
chickens). Among fish, the target animal is preferably of 
the order Clupeiformes (e.g., sardines, shad, anchovies, 
whitefish, salmon) - 

20 

Target Tissues 

The term "target tissue" refers to any whole animal, 
physiological system, whole organ, part of organ, 
miscellaneous tissue, cell, or cell component (e.g.,. the 

25 cell membrane) of a target animal in which biological 
activity may be measured. 

Routinely in mammals one would choose to compare and 
contrast the biological impact on virtually any and all 
tissues which express the subject receptor protein. The 

30 main tissues to use are: brain, heart, lung, kidney, liver, 
pancreas, skin, intestines, adipose, stomach, skeletal 
muscle, adrenal glands, breast, prostate, vasculature, 
retina, cornea, thyroid gland, parathyroid glands, thymus, 
bone marrow, bone, etc. 

35 Another classification would be by cell type: B cells, 

T cells, macrophages, neutrophils, eosinophils, mast cells, 
platelets, megakaryocytes, erythrocytes, bone marrow stomal 
cells, fibroblasts, neurons, astrocytes, neuroglia. 
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microglia, epithelial cells (from any organ, e.g. skin, 
breast, prostate, lung, intestines etc) , cardiac niuscle 
cells, smooth muscle cells, striated muscle cells, 
osteoblasts , osteocytes , chondroblasts , chondrocytes , 
keratinocytes, melanocytes, etc. 

Of course, in the case of a unicellular organism, there 
is no distinction between the "target organism" and the 
"target tissue". 

Screening Assays 

Assays intended to determine the binding or the 
biological activity of a substance axe called preliminary 
screening assays. 

. Screening assays will typically be either in vitro 
.(cell-free) assays (for binding to an immobilized receptor) 
or cell-based assays (for alterations in the phenotype of - 
the cell) . They will not involve screening of whole 
multicellular organisms, or isolated ' organs . The comments 
on diagnostic biological assays apply mutatis mutandis to 
screening cell-based assays. 

• In Vitro vs. In Vivo Assavs 

The term in vivo is descriptive of an event, such as 
binding or enzymatic action, which occurs within a living 
organism. The organism in question may, however, be 
genetically modified. The term in ^ritro refers to an .event 
which occurs outside a living organism. Parts of an 
organism (e.g., a membrane, or an isolated biochemical) are 
used, together with artificial substrates and/or conditions. 
For the purpose of the present invention, the term in vitro 
excludes events occurring inside or on an intact cell, 
whether of a unicellular or multicellular organism. 

In vivo assays include both cell-based assays, and 
organismic assays. The cell-based assays include both assays 
on unicellular organisms, and assays on isolated cells or 
cell cultures derived from multicellular organisms. The 
cell cultures may be mixed, provided that they are not 
organized into tissues or organs. The term organismic assay 
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refers to assays on whole multicellular organisms, and 
assays on isolated organs, or tissues of such organisms. 

In vitro D iagnostic Methods and Reagents 

5 

The in vitro assays of the present invention may be 
applied to any suitable analyte-containing sample, and may 
be qualitative or quantitative in nature.. 

10 Sample 

The .sample will no.rmally be a biological fluid, such as 
blood, urine, lymph, semen, milk, or cerebrospinal fluid, or 
a fraction or derivative thereof, or a bioiogical tissue, in 
the. form of, e.g., a tissue section or homogenate . However, 

15 the sample conceivably could be (or derived from) a food or 
• beverage, a pharmaceutical or diagnostic composition, soil, 
or surface or ground water- If a biological fluid or 
tissue, it may be taken from a human or other mammal, 
vertebrate or animal, or from a plant. The preferred sample 

20 is blood, or a fraction or derivative thereof. 

Binding and Reaction Assays 

The assay may be a binding assay, in which one step 
involves the binding of a diagnostic reagent to the analyte, 

25 . or a reaction assay, which involves the reaction of a 

reagent with the analyte. The reagents used in a binding 
assay may be classified as to the nature of their 
interaction with analyte: (1) analyte analogues, or (2) 
analyte binding molecules (ABM) . They may be labeled or 

30 insolubilized. 

In a reaction assay, the assay may look for a direct 
reaction between the analyte and a reagent which is reactive 
with the analyte, or if the analyte is an enzyme or enzyme 
inhibitor, for a reaction catalyzed or inhibited by the 

35 analyte. The reagent may be a reactant, a catalyst, or an 
inhibitor for the reaction. 

An assay may involve a cascade of steps in which the 
product of one step acts as the target for the next step. 
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These steps may be binding steps ^ reaction steps ^ or a 
combination thereof. 

Sigrnal Producing- System (SPS) 

5 

In order to detect the presence, or measure the amount, 
of an analyte, the assay must provide for a signal producing 
system (SPS) in which there. is a detectable difference in 
the signal produced, depending on whether the analyte is 

10 present or absent (or, in a quantitative assay, on the 

amount of the analyte) . The detectable signal may be one 
which is visually detectable, or one detectable only with 
instruments. Possible signals ■ include production of colored 
or luminescent products, alteration of the characteristics 

15 (including amplitude or polarization) of absorption or 

emission of radiation by an assay component or product, and 
precipitation or agglutination of a component or product. 
The term "signal" is intended to include the discontinuance 
of an existing signal, or a change in the rate of change of 

20 an observable parameter, rather than a change in its 

absolute value p The signal may be monitored manually or 
automatically. 

In a reaction assay, the signal is often a product of 
the reaction. In a binding assay, it is normally provided 

25 by a label borne by a labeled reagent. 

Labels 

The component of the signal producing system which is 
most intimately associated with the diagnostic reagent. is 

30 called the "label". A label may be, e.g., a radioisotope, a 
fluorophore, an enzyme, a co-enzyme, an enzyme substrate, an 
electron-dense compound, an agglutinable particle. 

The radioactive isotope can be detected by such means 
as the use of a gamma counter or a scintillation counter or 

35 by autoradiography. Isotopes which are particularly useful 
for the purpose of the present invention include ^H, ^^^I, 
"^I, ^^S, "C, and ^^P. "^I is preferred for antibody 
labeling. 
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The label may also be a flubrophore. When the 
fluorescently labeled reagent is exposed to light of the 
proper wave length, its presence can then be detected due to 
fluorescence. Among the most commonly used fluorescent 
5 labelling compounds are fluorescein isothiocyanate/^ 

rhodamine, phycoerythrin, phycocyanin^ allophycocyanin,. Qr 
phthaldehyde and f luorescamine . 

Alternatively, fluorescence-emitting metals such as 
"^Eu, or others of the lanthanide series, may be incorporated 

10 into a diagnostic reagent using such metal chelating groups 
•as diethylenetriaminepentaacetic acid (DTPA) of 
ethylenediamine-tetraacetic acid (EDTA)". 

The label may also, be a chemiluminescent compound.. The 
presence of the chemiliaminescently labeled reagent is then 

15 determined by detecting the presence of luminescence that 
arises during the course of a chemical reaction. Examples 
of particularly useful chemiluminescent labeling compounds 
are luminol, isolumino, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

20 Likewise, a bioluminescent compound may be used for 

labeling. Bioluminescence is a type of chemiluminescence 
found in biological systems in which a catalytic protein 
increases the efficiency of the chemiluminescent reaction. 
The presence of a bioluminescent protein is determined by 

25 detecting the presence of luminescence. Important 

bioluminescent compounds for purposes of labeling are 
luciferin, lucif erase and aequorin. 

Enzyme labels, such as horseradish peroxidase and 
alkaline phosphatase, are preferred. When an enzyme label 

30 is used, the signal producing system must also include a 

substrate for the enzyme. If the enzymatic reaction product 
is not itself detectable, the SPS will include one or more 
additional reactants so that a detectable product appears. 

An enzyme analyte may act as its own label if an enzyme 

35 inhibitor is used as a diagnostic reagent. 

Binding Assay Foirmats 

Binding assays may be divided into two basic types. 
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heterogeneous and homogeneous. In heterogeneous assays^ the 
interaction between the affinity molecule and the analyte 
does not affect the label, hence, to determine the amount or 
presence of analyte, bound label must be separated from free 
5 label: In homogeneous assays, the interaction does affect, 
the activity of the label,, and therefore analyte levels can 
be deduced without the need for a separation step. 

In one embodiment, the ABM is insolubilized by coupling 
it to a macromolecular. support, and analyte in the sample is 

10 allowed to compete with a known quantity of a labeled or 
specifically labelable analyte analogue. The "analyte . 
analogue" is a molecule capable of competing with analyte 
•for binding to the ABM, and the term is intended to include 
analyte itself. It may be labeled already, or it may be 

15 labeled subsequently by specifically binding the label to a 
moiety differentiating the analyte analogue from analyte. 
The solid and liquid phases are separated, and the labeled 
analyte analogue in one phase is quantified. The higher the 
level of analyte analogue in the solid phase, i.e., 

20 sticking to the ABM, the lower the level of analyte in the 
sample. 

In a "sandwich assay", both an insolubilized ABM, and a 
labeled ABM are employed. The analyte is captured by the 
insolubilized ABM and is tagged by the labeled ABM, forming 
25 a ternary complex. The reagents may be added to the sample 
in either order, or simultaneously. The ABMs may be the 
same or different. The amount of labeled ABM in the ternary 
•complex is directly proportional to the amount of analyte in 
the sample. 

30 The two embodiments described above are both 

heterogeneous assays. However, homogeneous assays are 
conceivable. The key is that the label be affected by 
whether or not the complex is formed. 
Conjugation Methods 
35 A label may be conjugated, directly or indirectly 

(e.g., through a labeled anti-ABM antibody), covalently 
(e,g., with SPDP) or noncovalently, to the ABM, to produce a 
diagnostic reagent. Similarly, the ABM may be conjugated to 
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a solid phase support to form a solid phase ("capture") 
diagnostic reagent. 

Suitable supports include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, 
natural and modified celluloses, polyacrylamides, agaroses, 
and magnetite. The nature of the carrier can be either, 
soluble to some extent or insoluble for the purposes of the 
present invention. 

The support material may have virtually any possible 
.structural configuration so long as the coupled molecule is 
capable of binding to its target. Thus the support 
configuration may be spherical, as in a bead, or 
cylindrical, as in the inside surface of a test tube, or th 
external surface of a rod. Alternatively, the surface may 
be flat such as a sheet, test strip, etc. 



Riolooical Assays. 

A biological assay measures or detects a biological 
response of a biological entity to a substance. 

The biological entity may be a whole organism, an 
isolated organ or tissue, freshly isolated cells, an 
immortalized cell line, or a subcellular component (such as 
a membrane; this term should not be construed as including 
an isolated receptor) . The entity may be, or may be derived 
from, an organism which occurs in nature, or which is 
modified in some way. Modifications may be genetic 
(including radiation and chemical mutants, and genetic 
engineering) or somatic . (e.g. , surgical, chemical, etc.). 
In the case of a multicellular entity, the modifications may 
affect some or all cells. The entity need not be the target 
organism, or a derivative thereof, if there is a reasonable 
correlation between bioassay activity in the assay entity 
and biological activity in the target y organism. 

The entity is placed in a particular environment, which 
may be more or less natural. For exart^le, a culture medium 
may, but need not, contain serum or serum substitutes, and 
it may, but need not, include a support matrix of some kind, 
it may be still, or agitated. It may contain particular 
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biological or chemical agents, or have particular physical 

parameters (e.g., temperature), that are intended to nourish 

or challenge the biological entity. 

There must also be a detectable biological marker for 

the response. " At the cellular level, the most common 

markers are cell survival and proliferation, cell behavior 
(clustering, motility), cell morphology (shape, color), and 

biochemical activity (overall DNA synthesis, overall protein 
synthesis, and specific metabolic activities, such as 

utilization of particular nutrients, e.g., consumption of 
oxygen, production of COj, production of organic acids, 
uptake or discharge of .ions) . 

The direct signal produced by the biological marker may 

be transformed by a signal producing system into a different- 
signal which is more observable, for example, a fluorescent 
or colorimetrJLc signal. 

The entity, environment, marker and signal producing 
system are chosen to achieve a clinically acceptable level 
of sensitivity, specificity and accuracy. 

In some cases, the goal will be to identify substances 
which mediate the biological activity of a natural 
biological entity, and the assay is carried out directly 
with that entity. In other cases, the biological entity is 
used simply as a model of some more complex (or otherwise 
inconvenient to work with) biological entity. In that 
event, the model biological entity is used because activity 
in the model system is considered more predictive of 
activity in the ' ultimate natural biological entity than is 
simple binding activity in an in vitro system. The model 
entity is used instead of the ultimate entity because the 
former is more expensive or slower to work with, or because 
ethical considerations forbid working with the ultimate 
entity yet. 

The mo.del entity may be naturally occurring, if the 
model entity usefully models the ultimate entity under some 
conditions. Or it may be non-naturally occurring, with 
modifications that increase its resemblance to the ultimate 
entity. 
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Transgenic animals, such as transgenic mice, rats, and 
rabbits, have been found useful as model systems. 

In cell-based model assays*, where the biological 
activity is mediated by binding to a receptor (target 
protein) , the receptor may be functionally connected to a 
signal (biological marker) producing system, which may be 
endogenous or exogenous to the cell. 
There are a number of techniques of doing this • 

"Zero-Hybrid" Systems 

In these systems, the binding of a peptide to the 
target proteiA results in a screenable or selectable 
phenotypic change, without resort -to fusing the target 
.protein (or a ligand binding moiety thereof) to an 
endogenous protein. It may be that the target protein is 
endogenous to the host cell, or is substantially identical ♦ 
to an endogenous receptor so that it can take advantage of ' 
the latter 's native signal transduction pathway. Or 
sufficient elements of the signal transduction pathway 
normally associated with the target protein may be 
engineered into the cell so that the cell signals binding to 
the target protein. 



"One-Hybrid" Systems 

In these systems, a chimera receptor, a hybrid of the 
target protein and an endogenous receptor, is used. The 
chimeric receptor has the ligand binding characteristics of 
the target protein and the signal transduction 
characteristics of the endogenous receptor. Thus, the 
normal signal transduction pathway of the endogenous 
receptor is subverted. 

Preferably, the endogenous receptor is inactivated, or 
the conditions of the assay avoid activation of the 
endogenous receptor, to improve the signal-to-noise ratio. 

See Fowlkes USP 5,789,184 for a yeast system. 

Another type of "one-hybrid" system combines a peptide: 
DNA-binding domain fusion with an unfused target receptor 
that possesses an activation domain. 



y/O 2004/013289 



73 



>CT/US2003/024053 



"Two-Hybrid" System 

In a preferred embodiment/ the cell-based assay is a 
two hybrid system.. This term implies that the ligand is 
incorporated into a first hybrid protein, and the receptor 
into a second hybrid protein. The first hybrid also 
comprises component A of a signal generating system, apd the 
second hybrid comprises component B of that system. 
Components A and B, by themselves, are insufficient to 
generate a signal. However, if the ligand binds the 
receptor, components A and B are brought into sufficiently • 
close proximity, so that they can cooperate to generate a 
signal. 

Components A and B may naturally occur, or be 
substantially identical to moieties which naturally occur, 
as components of a single naturally occurring biomolecule, 
or they may naturally occur, or be substantially identical 
to moieties which naturally occur, as separate naturally 
occurring biomolecules which interact in nature. 

Two-Hybrid System: Transcription Factor Type 

In a preferred "two-hybrid" embodiment, one member- of a 
peptide ligand: receptor binding pair is expressed as a 
fusion to a DNA-binding domain (DBD) from a transcription 
factor (this fusion protein is called the "bait"), and the 
other is expressed as a fusion to a transactivation domain 
.(TAD) (this fusion- protein is called the "fish", the "prey", 
or the "catch") . The transactivation domain should be 
complementary to the DNArbinding domain, i.e., it should 
interact with the latter so as to activate transcription of 
a specially designed reporter gene that carries a binding 
site for the DNA-binding domain. Naturally, the two fusion 
proteins must likewise be complementary. 

This complementarity may be achieved by use of the 
complementary and separable DNA-binding and transcriptional 
activator domains of a single transcriptional activator 
protein, or one may use complementary domains derived from 
different proteins. The domains may be identical to the 
native domains, or mutants thereof. The assay members may 
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be fused directly to the DBD or TAD, or fused through an 
intermediated linker. 

The target DNA operator may. be the native operator 
sequence, or a mutant operator. Mutations in the operator 
may be coordinated with mutations. in the DBD and the TAD, 
An example of a suitable transcription activation system is 
one comprising the DNA-binding domain from the bacterial 
repressor LexA and the activatioii domain from the yeast 
transcription factor Gal4, with the reporter gene operably 
linked to the LexA operator. 

It is not necessary to employ the intact target 
receptor; just the ligand-binding moiety is sufficient. 

The two fusion proteins may be expressed from the same 
or different vectors. Likewise, the activatable reporter 
.gene may be expressed from the same vector as either fusion 
protein (or both proteins), or from a third vector. 

Potential DNA-binding domains include Gal 4, LexA, and 
mutant domains substantially identical to the above. 

Potential activation domains include E. coli B42, Gal4 
activation domain II, and HSV VP16, and mutant domains 
substantially identical to the above. 

Potential operators include the native Operators for 
the desired activation domain, and mutant domains 
substantially identical to the native operator. 

The fusion proteins may comprise nuclear localization 
signals. 

The assay system will include a signal producing 
system, too. The first element of this system is a reporter . 
gene operably linked to an operator responsive to the DBD 
and TAD of choice. The expression of this reporter gene 
will result, directly or indirectly, in a selectable or 
screenable phenotype {the signal) . The signal producing 
system may include, besides the reporter gene, additional 
genetic or biochemical elements which cooperate in the 
production of the signal. Such an element could be, for 
example, a selective agent in the cell growth medium. There 
may be more than one signal producing system, and the system 
may include more than one reporter gene. 
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The sensitivity of the system may be adjusted by^ e.g., 
use of competitive inhibitors of any step in the activation 
or signal production process,, increasing or decreasing the 
number of operators, using a stronger or weaker DBD or TAD, 
etc. 

When the signal is the death or survival of the cell in 
question, or proliferation or nonprolif eration of the cell 
in question, the assay is said to be* a selection. When the 
.signal merely results in .a detectable phenotype by which the 
signalling cell may be differentiated from the same cell in 
a nonsignalling state (either way being a living cell), the 
assay is a screen. However; the term "screening assay" may 
be used in a broader sense to include a selection. When the 
narrower sense is intended, we will use the term 
"nonselective screen". 

Various screening and selection systems ar.e discussed 
in Ladner, USP 5,198,346. 

Screening and selection may be for or against the 
peptide: target protein or compound : target protein 
interaction . 

Preferred assay cells are microbial (bacterial, yeast, 
algal, protozooal), invertebrate, vertebrate (esp. 
mammalian, particularly human) . The best developed two- 
hybrid assays are yeast and mammalian systems. 

Normally, two hybrid assays are used to determine 
whether a protein X and a protein Y interact, by virtue of 
their ability to reconstitute the interaction of the DBD and 
the TAD. However, augmented two-hybrid assays have been 
used to detect interactions that depend on a third, non- 
protein ligand. 

For more guidance on two-hybrid assays, see Brent and 
Finley, Jr., Ann. Rev. Genet., 31:663-704 (1997); Fremont- 
Racine, et al.. Nature Genetics, 277-281 (16 July 1997); 
Allen, et al., TIBS, 511-16 (Dec. 1995); LeCrenier, et al., 
BioEssays, 20:1-6 (1998); Xu, et al . , Proc. Nat. Acad, sci . 
(USA), 94:12473-8 (Nov. 1992); Esotak, et al . , Mol . Cell. 
Biol., 15:5820-9 (1995); Yang, et al . , Nucleic Acids Res., 
23:1152-6 (1995); Bendixen, et al . , Nucleic Acids Res., 
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22:1778-9 (1994); Fuller, et al . , BioTechniques, 25:85-92 
(July 1998); Cohen, et al . , PNAS (USA) 95:14272-7 (1998); 
Kolonin and Finley, Jr., PNAS (USA) 95:14266-71 (1998). See 
also Vasavada, et al., PNAS (USA) ,• 88 : 10686-90 (1991) 
(contingent replication assay), and Rehrauer, et al., J. 
Biol. Chem., 271:23865-73 91996) (LexA repressor cleavage 
assay) . 

Two-Hybrid Systems: reporter Enzyme type 

In another embodiment, the components A* and B • 
.reconstitute an enzyme whicph is not a transcription factor - 

As in the last example-, the effect of the 
reconstitution of the enzyme is a phenotypic change which 
may be a screenable change, a selectable change, or both. 

In vivo Diagnostic Uses 

Radio-labelled ABM may be administered to the human or 
animal subject. Administration is typically by injection, 
e.g., intravenous or arterial or other means of 
administration in a quantity sufficient to permit subsequent 
dynamic and/or static imaging using suitable radio-detecting 
devices. The dosage is the smallest amount capable of 
providing a diagnostically effective image, and may be 
determined by means conventional in' the art, using known 
radio-imaging agents as a guide. 

Typically, the imaging is carried out on the whole body 
of the subject, or on that portion of the body or organ 
relevant to the condition or disease under study. The 
amount of radio-labelled ABM accumulated at a given point in 
time in relevant target organs can then be quantified. 

A particularly suitable radio-detecting device is a 
scintillation camera, such as a gamma camera. A 
scintillation camera is a stationary device that can be used 
to image distribution of radio-labelled ABM. The detection 
device in the camera senses the radioactive decay, the 
distribution of which can be recorded. Data produced by the 
imaging system can be digitized. The digitized information 
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can be analyzed over time discontinuously or continuously. 
The digitized data can be processed to produce images, 
called frames, of the pattern of uptake of the radio- 
labelled ABM in the target organ at a discrete point in 

5 time. In most continuous (dynamic) studies, quantitative 
data is obtained by observing changes in distributions., of 
radioactive decay in target organs over time. In other 
words, a time-activity analysis of the data will illustrate 
uptake through, clearance of the radio-labelled binding 

10 protein by the target organs with time. 

Various factors should be taken into consideration, in 
selecting an appropriate radioisotope. The radioisotope 
must be selected wi-th a view to obtaining good quality 
resolution upon imaging, should be safe for diagnostic use 

15 in humans and animals, and should preferably have a short 

physical half-life-so as to decrease the amount of radiation 
received by the body. The radioisotope used should 
preferably be pharmacologically inert, and, in the 
quantities administered, should not have any substantial 

20 physiological effect. 

The ABM may be radio-labelled with different isotopes 
of iodine, for example ^"l, "^I. or ^^^I- (see for example, 
U.S. Patent 4,609,725). The extent of radio-labeling must, 
however be monitored, since it will affect the calculations 

25 made based on the imaging results (i.e. a diiodinated ABM 
will result in twice the radiation count of a similar 
monoiodinated ABM over the same time frame) . 

In applications to human subjects, it may be desirable 
to use radioisotopes other than "^I for labelling in order to 

30 decrease the total dosimetry exposure of the human body and 
to optimize the detectability of the labelled molecule 
(though this radioisotope can be used if circumstances 
require) . Ready availability for clinical use is also a 
factor. Accordingly, for human applications, preferred 

35 radio-labels are for example, ^^Tc, ^^Ga, ^«Ga, ^°Y, ^^^In, 
ii3mij^^ 1231^ "«Re, "^Re or ^iiAt. 

The radio-labelled ABM may be prepared by various 
methods. These include radio-halogenation by the chloramine 
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- T method or the lactoperoxidase method and subsequent 
purification by HPLC (high pressure liquid chromatography) , 
for example as described by J. Gutkowska et al in 
"Endocrinology and Metabolism Clinics of America: (1987) ii 
(1):183. Other known methods of radio-labelling can be 
used, such as IODOBEADS«. • 

There are a number .of different methods of delivering 
the. radio-labelled ABM to the end-user. It may be 
administered by any means that enables the active agent to 
reach the agent's .site of action in the body of a mammal. 
Because proteins are subject to being digested when 
administ-ered orally, parenteral administration, i.e., 
intravenous, subcutaneous, intramuscular, would ordinarily 
be. used to optimize absorption of an ABM, such as an 
antibody, which is a protein. 
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EXAMPLES 
Example 1 

Differential expression of Genes i n Kidneys of 
Streptozotocin-Treated Mice as a Resu lt of GHR/BP knockout 

To identify genes potentially involved in the 
protection of kidneys during diabetes, we induced diabetes 
by STZ treatment of GHR/BP -/- (knockout) mice (prepared as 
described by Zhou, et al., 1997, supra ^ and nontransgenic 
(+/+) laice* These mice had a- 129 Ola/BalbC genetic 
background, A PCR-based strategy was used to generate cDNA 
subtraction libraries from RNA isolated from the kidneys of 
diabetic and +/+ mice after ten weeks of hyperglycemia. 

- To identify genes whose increased expression potentially 
results in protection against glomerulosclerosis, the cDRAs 
expressed in +/+ kidneys were subtracted from the cDNAs 
expressed in kidneys, resulting in a library (-/-) • 

consisting of cDNAs upregulated in the protected kidney. 

To identify genes whose decreased expression potentially 
results in protection against glomerulosclerosis, the cDNAs 
expressed in kidneys were subtracted from the cDNAs 

expressed in +/+ kidneys, resulting in a library (+/+) 
consisting of cDNAs down-regulated in the protected 
kidney. cDNA clones were spotted on duplicate membranes and 
screened for differential expression using the -/- and +/+ 
subtraction libraries as probes. Differential expression o-f 
clones displaying differential hybridization to the and 
+/+ library probes was confirmed by northern analysis of RNA 
from diabetic and +/+ kidneys using the clones as 

probes- Nucleotide sequences were determined for each clone 
and compared by BLAST analysis to the sequences in the 
public domain databases. 

We have obtained several partial cDNA clones whose 
altered expression is associated with the protection of the 
kidney from damage. 

Induction of diabetes with streptozotocin 

Two to three month old female mice were made diabetic 
(DB) with daily intraperitoneal injections of streptozotocin 
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(STZ; 80-85 ]ag/g body weight, dissolved in 0.1 M citrate 
buffer at a concentration of 8-8.5 pg/ml) until blood 
glucose levels exceeded 200 mg/dL or for a maximum of 8 
injections. Nondiabetic (ND) controls received daily 
5 injections of citrate buffer. After 4 or 10 weeks of 

diabetes {defined as maintenance of a blood glucose level of 
at least 300 mg/dL) , the mice were sacrificed and the right 
kidney quickly removed, weighed, and frozen in liquid 
nitrogen or immediately processed for extraction of RNA. . 
10 The left kidney was perfused through the dorsal aorta with 
phosphate buffered saline followed by 4% paraformaldehyde 
(PFA) in 0.15 NaCi, then removed, cut in half longitudinally 
and stored in 4% PFA for histological analysis. 

15 RNA isolation 

Total RUfi was isolated from kidneys of diabetic female 
GHR/BP +/+ and mice after 4 or 10 weeks of diabetes 

induced by streptozotocin using the RNA STAT-60 Total 
RNA/mRNA Isolation Reagent according to the manufacturer's 

20 instructions (Tel-Test, Friendswood, TX) - 

cDNA synthesis 

Prior to cDNA synthesis, a portion (50 \ig) of RNA was 
further purified to remove small RNAs using the RNeasy Mini 

25 protocol for RNA clean up as instructed by the manufacturer 
(Qiagen Inc., Santa Clarita, CA) . cDNA then was synthesized 
usin^ 1 )ig of this purified sample of total RNA from one 
diabetic GHR/BP +/+ and one diabetic GHR/BP -/- mouse using, 
the SMART PGR cDNA Synthesis Kit according to the 

30 manufacturer's instructions (CLONTECH, Palo Alto, CA) . 

Generation of cDNA Subtraction libraries 

Forward- and reverse-subtracted cDNA libraries were 
generated using the PCR-Select cDNA Subtraction Kit 
35 (CLONTECH, Palo Alto, CA) and the +/+ and -/- samples- One 
library (+/+ or G) included clones down-regulated in the 
diabetic -/- mice and the other library {-/- or H) included 
clones up-regulated in diabetic -/- mice. 
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Isolation of Individual clones 

After generating the cDNA subtraction libraries, the 
PGR product ends were made blunt by treatment with Pfu DNA 
polymerase (Stratagene, La Jolla, CA) and subcloned Into a 
bacterial plasmid vector using the Zero Blunt TOPO PGR 
Cloning Kit as instructed by the manufacturer (Invitrogen 
Corp., Carlsbad, CA) . Individual clones were obtained by 
plating on selective media. 

Screening by differential hybridization 

cDNA arrays of clones from the forward and reverse 
subttacted libraries were screened with prober made from 
each library using the PCR-Select Differential Screening Kit 
according to the manufacturer's instructions (CLONTECH> Palo 
Alto, CA) . 

Nucleotide sequence deteirmination 

Plasmid DNA from bacterial colonies carrying the 
differentially expressed cDNA inserts was isolated using the 
QIAprep Spin Miniprep Kit according to the manufacturer's 
instructions (Qiagen Inc., Santa Clarita, CA) . Nucleotide 
sequences were determined by use of the ABI PRISM BigDye 
Terminator Cycle Sequencing Ready Reaction Kit with 
electrophoresis on the ABI PRISM 377 DNA Sequencer (PE 
Applied Biosys terns, Foster City, CA) . 

Database Searches 

Nucleotide sequences and predicted ami.no acid sequences 
were compared to public domain databases using the Blast 2.0 
program (National Center for Biotechnology Information, . 
National Institutes of Health) . Nucleotide sequences were 
displayed using ABI prism Edit View 1.0.1 (PE Applied 
Biosysteras, Foster City, CA) . 

Nucleotide database searches were conducted with the 
then current version of BLASTN 2.0.12, see Altschul, et al., 
""Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs". Nucleic Acids Res., 25:338 9-3402 
(1997) . Searches employed the default parameters, unless 
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Otherwise stated. 

For blastN searches, the default was the blastN matrix 
(1,-3)/ with gap penalties of 5 for existence and 2 for 
extension. 

5 Protein database searches were conducted with the then- 

current version of BLAST X, see Altschul et al. (1997), 
su pra . Searches employed the default parameters, unless 
otherwise stated. The scoring matrix was BLOSUM62, with gap 
costs of 11 for existence and 1 for extension. The standard 

10 low complexity filter was used. 

"'ref indicates that NCBI's RefSeq is the source 
database. The identifier that follows is a RefSeq accession 
number/ not a GenBank accession number. ^^RefSeq sequences 
are. derived from GenBank and provide non-redundant curated 

15 data representing our current knowledge of known genes. Some 
records include additional sequence information that was 
never submitted to an archival database but is available in 
the literature. A small number of sequences are provided 
through collaboration; the underlying primary sequence data 

20 is available in GenBank, but may not be available in any one 
GenBank record. RefSeq sequences are not submitted primary 
sequences. RefSeq records are owned by NCBI and therefore 
can be updated as needed to maintain current annotation or 
to incorporate additional sequence information.'' See also 

25 http : //www . ncbi ■ nlm. nih . gov/LocusLink/ref seq, html 

It will be appreciated by those in the art that the 
exact results of a database search will change from- day to 
day, as new sequences are added. Also, if you query with a 
longer version of the original sequence, the results will 

30, change. The results given here were obtained at one time 
and no guarantee is made that the exact same hits would be 
obtained in a search on the filing date. However, if an 
alignment between a particular query sequence and a 
particular database sequence is discussed, that alignment 

35 • should not change (if the parameters and sequences remain 
unchanged) . 

Northern analysis 
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Positive clones r identified by the differential 
hybridization screen, were used as probes in Northern 
hybridization analyses to confirm their differential 
expression. Total RNA isolated from the kidneys of diabetic 

. 5 • female +/+ and mice was resolved by agarose gel 

electrophoresis through a 1% agarose, 1% formaldehyde 
denaturing gel, transferred to positively charged nylon 
membrane, hybridized in DIG Easy Hyb with the cDNA insert 
labeledvby asymmetric PGR with Digoxigenin, and analyzed 

10 using anti-Digoxigenin-alkaline phosphatase conjugate and 

CDP-Star according to the manufacturer's instructions (Roche 
Diagnostics Corp., Indianapolis, IN). 

CfiARACTERXZATION OF CLONES 
15 Clone 626 

Insert* size: 1527 bp query sequence (with gap) (SEQ ID 
NOs:l and 2) 

Nucleotide database search 
Blast N: 

20 Clone G26 is apparently a partial cDNA of a gene 

encoding a protein similar to Mus musculus kidney 3 beta-^ 
hydroxysteroid dehydo.genase type 4 {Hsd3b4) . The highest 
scoring DNA alignment (8 90 bits, E valu3 0.0) was of bases 
3-560 of clone G26 with bases 1602-1048 of Hsd3b4 . (Note 

25 that since the orientation of the. cDNA inserts in the 
cloning vector was not known, ^^plus'' was assigned 
arbitrarily for the purpose of thfe Blast alignment. So, 
since -the match is to the minus strand of a known DNA, we 
assume that the strand labeled ^^plus" was actually the minus 

30 strand of G26.) The percentage identity was 96% (540/561), 
with 9 gaps. 

There were four equally high scoring matches: Genbank 
BC01344.9.1, ref NM_008294.1, dbj AB049424.1 and Genbank 
L16919.1. The first is an M. musculus cDNA ^^similar to 
35 hydroxysteroid dehydr6genase-4 delta <5>-3-beta'' . The . 

second is M. musculus hydroxysteroid dehydogenase-4, delta 
<5>-3-beta. The third is M. musculus clone: 123-1, derived 
from 4x chromosome of T(X;4) 37H translocation. The. fourth 
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is the M. musculus cDNA (pK411A) 3-beta-hydroxystero±d 
dehydogenase/delta-5-delta-4 isomerase (Hsd3b4) . The other 
Tiigh scoring DNA alignments were with genes closely related 
to Hsd3b4. 

The highest hurtian matches in the main database were 
ail4504506lref INM 000862.11 Homo sapiens hydroxy~delta-5-st . .. . '7_6- 7e-ll 
qil9581790leinblAL121995 . 121HSDJ92QG3 Human DNA sequence fro... 2£ 7e-ll 
qil257052lqblS45679.1IS45679 3 beta-hydroxysteroid dehydrog. . . 2£ 7e~ll 
ail287843ieinblX53321.1lHS3BHD54i H. sapiens inRNA for 3Beta-h. , . 76 7e-ll . 
• ai|23861ieinblX55997.1lHS3B5H5E Human gene f or * 3-beta-5-hydr . . *• 76 7e-ll 
aill79467iqblM38180.HHOMBHSD Human 3-beta-hydroxysteroid d. . . 76 7e-ll 
aill77194lablM63397.1lHUM3BHSD03 Human 3-beta-hydroxysteroi . . . 16 7e-ll 
qHl77190lqblM35493.1lHUM3BHSD Human placental 3 -beta-hydro . 76 7e-ll 
aill84398lablM27137,llHUMHSDB Human 3 beta-hydroxy-5-ene st. . . .76 7e-ll 
Qlll84275iab iM28392,llHUMHODB3 Human 3-beta-hydroxysteroid. . . 16 '76-11 

However, in the human EST library, the top scores were 
aillS78840lablAA071460.1lAA071460 zm73f03.sl Stratagene neu. . . 868 0.0 
ai|1688820lablAA129037 . 1IAA129037 zolLel2.sl Stratagene neu... 819 0.0 
aill578927labiAA071539. 1IAA071539 zm7 4b0 6 . sl * Stratagene neu... 791 0.0 
Qill578624lablAA071Q72. 1IAA071072 zm73fo3.rl Stratagene neu... 660 .0.0 

Thus, of the h\Haan sequences ^ Clone G26 is most similar 
to a human EST (gi: 1578840), not to any of the human cDNAs 
reported in GenBank (see included alignments) . This EST is 
conceivably the cognate hxaman gene. However, there is a 
warning in the database to the effect that ^^there is 
evidence that suggests that the 38.4-well parent plate of 
this clone contains both human and mouse derived clones. 
Thus, the origin of this clone is uncertain.'' The same 
warning appears for gi:1688820, gi:1578927, and gi:1578624. 

Protein database search 

Blast X: The best score in the main database was with 
Mus musculus -hvdroxysteroid dehydrogenase-4 , delta <5>-r3-r 
beta, ref NP_032320.1 (score 187 bits, expect 2e~71, 95% 
identity) . 

In the human database, the best scores were for 
Qii45Q4507lreflNP QQ0853.ll (NM_000862) hydroxy-delta-5-ste. . . 123, 9e-48 
aHl79468lqblAAA51831,ll {M38180) 3-beta-hydroxysteroid deh. . . 12S. 9e-48 
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ail4504509irefiNP Q00189>ll (NM_000198) hydroxy- de 1 ta-5-ste, . . 122 2e-46 

aiil2043436iemblCAC19801.1l (AL359553) dJ871G17.4 (novel 3-... 122 9e-45 

Qill7488989lrefiXP 060821.11 (XM_060821) similar to dJ871Gl- . . 122 9e-45 

Qill74890Q3lreflXP 060827.11 {XM_060827) similar to 3 BETA-... 122. 3e-44 

aill9979635lQbiAAM08704 .11 (AF252254) 3-beta-hydroxysteroid. . . 122 2e-39 

oil3a6780lablAAA36001 . 11 (M28392) 3-beta-hydroxysteroid deh. . . 128 2e^39 

Note that these proteins are functionally related. 

Northern analysis: Down-regulated in protected (DB -/-) 
kidney 

ND +/+>DB+/+=ND-/-»DB-/-. 

The .characteristics of Hsd3b4 are discussed in Clarke ^ 
et ai., Mol. Endocrinol., 7:1569-78 (1993). Its 
relationship to other types of beta-hydroxysteroid 
dehydrogenases is discussed in Payne, et al., J. Steroid 
Biochem. Mol. Biol., 53:111-8 (1995). 

Clone H8 

Insert size: 875 bp query sequence 875 bp (SEQ ID NO: 2) 
Nucleotide database search 

Blast N: The best mouse match was to gp bF53964 4.1 EST 
1197 bits e=0.0 97%. In the main database, the best match 
was to Genbank AC023194.8, H. Sapiens clone RP11-216N21, E 
value of 8e-40,' score of 163 bits. Query 107-280 was 
aligned to subject 139697-139870, with 151/174 (86%) 
identities. There were several other alignments, too. 

The score for the match to Homo sapiens ATPase, . . 

XM_043092.1 was 40 bits, with E=O.O08. 

Looking at human ESTs, the top matches were 

ai|15755092lablBI763514.1lBI763514 603050239F1 NIH_MGC_116 . . • HI e-174 

aill329317QlQblBG399722.1lBG399722 602441316F1 NIH_MGC_75 H. . . 414 e-113 

ail5339253iablAI791537lAl7 91537 oj41bll,y5 NCI_CGAP_Kid3 ... 408 e-111 

Qill3340125lablBG433708 . 11BG433708 602497822F1 NIH_MCG_75 H. . . 363 7e-98 

Qill21585981ablBF82Q247. 1IBF820247 CM0-RT0018-181100-701-cl . • . 35^9 le-96 

all5054161lablAI733Q48.1lAl733048 oj41bll.x5 NCI_CGAP_Kid3 ... 33^ 6e-89 

Qill5751872lablBI7660294.1lBI760294 603045361F1 NIH_MGC_116. . . SOX 2e-79 



Blast X: The top scoring sequences are: 
gi|18570892|ref|XP_088368.1|Hypothetical protein XP_088368 
oil73Q4909lref INP 038505.11 (NM_013477) ATPase, H+ transpor. 



379 
370 



e-104 
e-102 
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aill2643287lsplP51863lV AQD MOOSE Vacuolar ATP synthase subu. . . 370 e-102 

aiil4776195lreflXP_Q43091.1l ATPase/ H+ transporting, lysos... 370 e-102 

Qill5029719lcrolAAHll 075 . 1IAAH11075 {BC011075) Unknown {prot.., 231 e-101 

Qi!l8543319iref INP 570080.11 {NM_130724) CG2934 gene produc. . . 354 3e-97 

qH12585456IspIO25531IWA0D MANSE Vacuolar ATP synthase s\ibu. . . 352 4e-97 

Qill6259229lablAAK85455.2lAC006611. 2 (AC006611) Hypothetica. . . 337 2e-92 

aill7505761lreflNP 491515. ll {NM_059114) vaculolar ATPase [..• 223. 4e-87 

XP_088368.1 is a hypothetical protein; region 15.. 214 
has -been identified as a "'ATP synthase (C/AC39) subunit. 
Sequence xP_088368 . 1 * was replaced on May 7, 2002 with 
gi:2048 5329; . ""similar to Ac39/physophilin (Homo sapiens)" 

XP_O4309.1 is ATPase, H+ transporting, lysosomal 
(vacuolar protein pump), H. sapiens. It is also known as 
"^similar to vacuolar ATP synthase subunit D''. Between query 
3-818 and^ subject 9-279/ there was 65% identity (179/272), 
with a score of 370 bits, E=e-102. 

Best mouse match ref NP_038505.1 370 bits e-102 65% 
vacuolar adenosine tri-phosphatase subunit D. 



Northern analysis: Up-r^gulated in protected (DB-/-) 
kidney 

DB > > 

ND -/- > = 
ND +/+ = 
DB +/+ 

Clone Hi 

Insert size: 966 bp, query sequence 966 bp (SEQ ID 

NO: 4) 

Nucleotide database search 

Blast N: Matches with GenBank U96635 Mus musculus 
ubiquitin protein ligase Nedd-4 mRNA (1780 bits, E 0.0), 
when bases 1-966 of HI are aligned with bases 3775-4739 of 
the database sequence. The percentage identity is 98% 
(957/969), with 7 gaps. 
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There is an even higher (1804) scoring alignment with 
BC007184^ musculus clone IMAGE : 3591422 . 

The highest scoring human sequence is ref XM_04 6129.1f 
Homo sapiens neural precursor cell expressed, 
developmentally down-regulated 4 (NEDD 4), with a score of 
105 bits and E=4e-20. 

There are identical scoring human ESTs, e.g., BG035335 
and BF357 965. 

Protein database search 

Blast X: Nothing significant.. This is expected; the 
coding sequence of U 96635 is at bases 612-2945, so the 
clone is in 3' UTR of U96635.' 

Northern analysis: Up-regulated in protected (DB -/-) 
kidney 

DB-/-»DB+/+=ND+/+=ND-/- . 

As described by Kumar, et al . , Genomics, 40:435-446 
(1997), Nedd4 encodes a protein which is strongly expressed 
during early CNS development. It contains a calcium/phospho 
lipid binding domain, three protein-protein interaction 
domains (the WW domains) , and a ubiquitin-protein ligase 
domains. Ubiquitin labels proteins desired for degradation 
by substrate-specific ubiquitin conjugating enzymes and 
ubiquitin-protein ligases . 

The alignment of mouse, human (Genbank D42055) and rat 
(Genbank U50842) DNA sequences is set forth in Kumar, Fig. 

> 

2. 

Pirozzi, et al., J. Biol. Chem., 272:14611-6 (1987) has 
identified other hxaman WW domain-containing proteins; these 
are aligned with, inter alia > Nedd-4 in Fig. 2. 

Clone Fl 

Insert size: 1159 bp, query sequence 1159 bp (SEQ ID 

N0:5) 

Nucleotide database search 

Blast N: Matches with ref NM 019973.1 Mus musculus Son 
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cell proliferation protein mRNA (1931 bits, E=0.0, 99% 
identity between query 150-1159 and subject 413-1419) , 
GenBank AFl 93607 Mus musculus truncated SON protein (Son) 
inRNA (1931 bits, E value=0.0/ 99% identity between query 
150-1159 and subject 413-1419) and with Genbank AF193597 Mus 
musculus SON protein mRNA (1931 bits, E value 0.0, 99% 
identity between query 150-1159 and subject 156-1162) . 

The best human match was to ref NM_058183.1 Homo 
sapiens SON DNA binding protein mRNA, 922 bits, E=0.0, 86% 
identity of query 150-1159 to subject 414-1435. 

Protein database search 

Blast X: Matches with ref NP_064357 and AAF23121.1 
(AF193607) Mus musculus truncated SON protein (624 bits, 
Er=e-178, 86% identity of query 3-1157 with subject 80-463) 
and GenBank AAF23120.1 (AF193606) ^Mus-musculus Son protein 
(624 bits, E=e-178, 86% identity over 385 a. a.). 

Please see also ref NP_003094.2, the H. sapiens SON 
DNA-binding protein, KIAA1019 (score 497 bits, E=e-140, 
align query 3-1157 to subject 80-468 for 278/390 (71%) 
identities, 5/390 gaps) . 

Northern analysis: up-regulated in protected (DB -/-) 
kidney DB-/-»ND-/-=ND+/+»DB+/+ . 

The SON protein is discussed in Wynn, et al . , 
""Organization and Conservation of the GART-S0N-3SG locus in 
mouse and human genomes''. Genomics, 68 (l):57-62 (2000); its 
gene is within the Down Syndrome Critical Region. 

Of the human sequences, clone Fl is most similar to the 
SON DNA binding protein (gi : 15055517 ) with 71% identity and 
e-140. 

Clone 638 

1050 bp insert, 842 bp query sequence (SEQ ID NO: 18) 

Highest alignment score was with Genbank AF178454.1, Mus 
musculus anti-human apolipoprotein A monoclonal antibody 20 
L Kappa light chain, 

Identities=538/565 (95%), score 888 bits, E=0.O. 
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Highest alignment score with a Human mRNA was with Genbank 
S65921.1, anti-colorectal carcinoma light chain glycoprotein 
CANAG-50 specific IgGl Kappa 

Iclentities=358/387 (92%), score 529, e-148 • 

At the protein level, the highest score was with gi 
1310829, pdb IKNO, Igg2a -p-nitrophenyl methyl phosphonate. 
The highest human score was with gb AAB28160.1 (865921) an 
anti-colorectal carcinoma antibody, score 258, E=3e.-71, See 
•Zemel, et al., Molec. Immunol., 31: 127-37 (1994) . 

Northern analysis : up-regulated in protected (DB-/-) 
kidney. 



In like manner, we have identified several additional 
clones, with the following Northern analyses (ND 
nondiabetic/no STZ; DB diabetic/STL; NT nontransgenic; HO- 
transgenic for GHR/BP Knock out) : 

G9: NDNT<DBNT=DBHO=NDHO (SEQ ID,NO:29) 
Cytochrome oxidase III 

G16: NDNT=DBNT<DBHO<NDHO (SEQ ID NO: 30) 
Hypothetical protein 

G24: DBNT>NDNT=DBHO=NDHO (SEQ ID NO: 31) 
Homolog to ERG2 protein 

628: NDNT=DBNT>DBHO=NDHO (SEQ ID NO: 32) 
Glutathione peroxidase 3 

Example 2 

Differential Expression of Genes in Kidneys of Nontransaenic 
Mice as a result of Stre otozotocin Treatment 

End stage renal disease (ESRD) is a costly outcome of 
diabetes, yet it is a difficult complication to prevent. 
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Factors that influence its development include heredity, 
blood glucose levels, and blood pressure, ESRD is a 
progressive disease that can take many years to develop. It 
can sometimes be treated, and even reversed, if caught at an 
early stage. Thus, an understanding of the molecular basis 
for the development and progression of diabetic nephropathy 
culminating at ESRD is crucial for the diagnosis, treatment 
and prevention of this debilitating disease. 

'Streptozotocin-induced diabetes in mice provides an 
excellent model system for studying the development of 
insulin-dependent diabetes mellitus (IDDM) nephropathy / at . 
both the physiological and molecular level. We, and others, 
have used this model system to examine the role of specific 
genes or gene products in the development of diabetic 
nephropathy leading to glomerulosclerosis. While 
significant advances have been made, much information is 
still lacking. In an attempt to further our knowledge of 
the molecular mechanisms resulting in diabetic nephropathy, 
we are using a polymerase chain reaction (PCR) -based cDNA 
subtraction method to compare populations of mRNAs expressed 
in kidneys of streptozotocin-induced diabetic mice at 
various developmental stages of the disease with populations 
of mRNAs expressed in healthy kidneys by creating libraries 
containing virtually all of the gene products (cDNAs; novel 
and known) involved in the progression of 
glomerulosclerosis. Nephropathy-correlated genes, thus 
.identified, may be used as molecular markers for disease 
progression and as therapeutic drugs or drug targets. 

In this example, subtractive hybridization was used to 
identify genes differentially expressed in diabetic (STZ- 
treated) and control mice. These mice, unlike those of 
Example 1, were all nontransgenic. 

Several partial cDNA clones were identified whose 
altered expression is associated with kidney damage. The 
protocol was the same as for Example 1, except as set forth 
below. 



RNA isolation 
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Total RNA was isolated from kidneys of wildtype female 
mice after 4 or 10 weeks of diabetes induced by 
streptozotocin/ as well as similarly aged non-diabetic 
wildtype controls, .using the RNA STAT-60 Total RNA/mRNA 
Isolation Reagent . according to the manufacturer's 
instructions (Tel-Test, Friendswood, TX) . 

.cDNA subtraction libraries 

One library (E) included clones up-regulated in 
diabetic mice and the other library (F) included clones 
down-regulated in diabetic mice. 

Northern analysis 

Positive clones, identified by the differential 
hybridization screen, were used as probes in Northern 
hybridization analyses to confiirm their differential 
expression. Total RNA isolated from the kidneys of female 
mice after 10 weeks of diabetes induced by streptozotocin, 
as well as non-diabetic controls, was resolved by agarose 
gel electrophoresis through a 1% agarose, 1% formaldehyde 
denaturing gel, transferred to positively charged nylon 
membrane, hybridized in DIG Easy Hyb with the cDNA insert 
labeled by asymmetric PGR with Digoxigenin, and analyzed 
using anti-Digoxigenin-alkaline phosphatase conjugate and 
CDP-Star according to the manufacturer's instructions (Roche 
Diagnostics Corp., Indianapolis, IN). 

CHARACTERIZATION OP CLONES * 
Clone Fl 

The results of a sequence search for this clone are 
discussed in Example 1. Besides being up-r^gulated in 
protected (DB -/-) kidney, it was down-regulated in diabetic 
+/+ kidney 

ND+/+»DB+/+ 

Clone F4 

Insert size: 616 bp query sequence 616 bp (SEQ ID 

NO: 6) 
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Nucleotide database search 

Blast N: Matches with ref. NM^010594.1 Mus musculus 
kidney androgen regulated protein (Kap) inRNA 

E Value=0.0 (1154 bits) (99% identity of query 9- 
612 with subject 3-606) 

With H. sapiens FKSG22 itiRNA (AF319957 . 1) , the alignment 
yielded, a score of 1098 bits (E=0.0) for query 33-612 with 
sxibject 1-580 (576/581 [99%] identities). 

Protein database search 

Blast X: Matches with ref. NP_034724.1 (NM 010594) Mus 
musculus kidney androgen regulated protein and human 
AAH08576.1 kidney androgen regulated protein. 

E value«le-61 (235 bits) (99% identity of query 
54-416 with subject 1-121) 

Northern analysis: Down-regulated in diabetic kidney 
ND+/+»>DB+/+ 

Of the human sequence^ clone F4 (and F6) is most similar to 
FKSG22 mRNA (gi : 12276125) , implicated in ovarian cancer, and 
kidney androgen regulated protein mRNA (gi : 14250295) ; e=0. 
Both mRNAs encode the protein, KAP. 

The kidney androgen-regulated protein (KAP) is encoded 
by a relatively abundant mRNA (4% of poly (A) RNA in male 
mouse kidney) . The concentration of KAP mRNA is 3-5 fold 
higher in males, or in testosterone-treated females, than in 
untreated females. It appears to be expressed in the 
epithelial cells of the proximal convoluted tubules, in 
particular by cells of the SI, S2 and S3 segments. Several 
possible hormone-responsive elements have been identified in 
the 5 '-flanking region. The KAP gene is a single copy gene, 
but seems to be highly polymorphic. See generally Niu, et 
al., DNA & Cell Biol., 10:41-8 (1991). 

For KAP, see also Homo sapiens, kidney androgen 
regulated protein, clone MCe:18182, IMAGE : 4155347, mRNA, 
complete cds . Accession #BC008576, version #BC008576.1, GI : 
14250295. 
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Clone F5 (SEQ ID NO: 7) 

Insert size: -1000 bp, query sequence 881 bp, SEQ ID 

NO: 6 

Nucleotide database search 

Blast N: Top matches are with (1) gi20881938, ref 
XM_12787 6.1 ^^Mus musculus similar to data source: SPTR, 
source key:Q9Z0S6, evidence : ISS~putative~similar to CLAUDIN- 
10" (score 1233 bits E 0.0), (2) gi 21617848, ref 
NM_023878.1, "Mus musculus claudin 10 (CldnlO) (score 125 
bits E 0.0)", and : (3) . gil2860620, dbj AK020131 "^Mus. musculus 
:12 days embryo male wolffian duct includes surrounding 
region cDNA> RIKEN full-length enriched library", (1215 bits 
E 0.0), and several other claudin-lO-related sequences (gb 
•BC021770.1, dbj AK055855.1, BC029019.1, ref NM_021386.1, gb 
AFli24425.1) , 

~ The highest-ranked hianan sequence was gi 16550687, dbj 
AK055855.1, ^'Homo sapiens cDNA FLJ31293 fis, clone 
KIDNE2007569, moderately similar to claudin 10." See also 
gi 16444694, emb AL357061.19, and gi 17475396, ref 
XM_007076.4. 

The cognate human claudin-10 sequence is ref. 
NM_006984.1 (165 bits, E=le-37) or Genbank U89916. See also 
gi 17475396, ref XM_007076.4, and gil5012044, Genbank 
BC0109.20. 

Protein Database search 

Blast X: Best matches were with (1) gi 20881939, ref 
XP_127876.1, '"'similar to data source: SPTR, source key: 
Q9Z0S6, evidence :ISS~putative~similar to CLAUDIN-10 [Mus 
musculus]" (236 bits, E=le-74); (2") gi 21617849, ref 
NP_07 6367 .1, "claudin 10" (234 bits E=3e-74), (3) gi 
16550688, dbj BAB71030.1, AK055855, "unnamed protein product 
[Homo sapiens]" (225 bits, E=4e-7l); and (4) gi 18256029, gb 
A7^21770.1, "similar to RIKEN cDNA 6720456116 gene [Mus 
mussculus]" (236 bits, E=le-68) .' 

The cognate human protein sequence is ref NP_008915.1 (E=6e- 
40) . 
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Northern analysis: Down-regulated in diabetic kidney 
ND+/+>DB+/+ 

Claudins are integral membrane proteins localized as 
tight junctions f. see Kubota, et al., Curr. Biol. 9:1035-8 
(1999); Monta, et al . , J- Cell. Biol., 145:579-88 (1999); 
Monta, et al., Proc. Nat. Acad. Sci. (USA), 96:511-16 
(1999) . 

Clone F6 (SEQ ID NO: 8) 

This is a slightly shorter version (611 bp) of clone 
F4.. • • 

Clone E39 

Insert size 981 bp (SEQ ID N0s:15 and 16) 

.Blast N Similar to gb U65535.1 mouse immunoglobulin 

kappa-chain mRNA, 34 6 bits 2e-92 

The best human match was to gbS65921.1/ anti-colorectal 
carcinoma light chain=glycoprotein CANAG-50 specific. IgGl 
kappa 338 bits, 4e-90; See Schable, et al., Eur. J. 
Immunol. r 29:2082-6 (1999). 

Blast X 

Human protein cognate: 1FH5L, pdb 1FH5 .Chain L crystal 
structure of the Fab Fragment of the Monoclonal Antibody Mak 
33, 93 bits 5e-19. 

See also Mouse gb AAC04542.1, monoclonal antibody kappa 
light chain [mus musculus] 61 bits, . 8e-15 . 

Northern analysis: Up-regulated in diabetic kidney 
(DB+/+»ND+/+) and damaged bGH kidney (bGH>NT) . 

Clone F27 

-1000 bp insett 820 bp query sequence (SEQ ID NO: 17) 
End 337 nt similar to ref NM 013559.1 mus musculus heat 
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shock protein, 105 JcDa (Hsp 105) 
Blast N mRNA, 3387 nt 

Identities=329/338 (97%), E=e-173, 607 bits 
•ref XM_036358.1 Homo sapiens heat shock 105 kD (HSP105B) 
5 itiRNA, 220 bits, 7e-55, 83%. 

dbj BAA11035.1 (D67016) heat shock protein 105KD alpha [mus 
musculus] , 182 bits, 3e-45, 75%, 

Human protein cognate: 
10 Blast X. Heat shock 105kD [homo . sapiens] ref XP_036357.1 
Identities=95/136 (69%), le-42 
Rest of sequence is novel 

Northern analysis: Down-regulated in diabetic kidney 
(ND+/+>DB+/+) 

15 

The top matches in the nucleic acid alignment were 
ail7305168lref IN^013559, 11 Mus musculus heat shock protein... 607 e-173 
. aiJ84065HablL40406.1lMUSHSPE Mus musculus heat shock prote. . . 607 e-173 
20 aill001008ldbilD67016.1lD67016 Mus wagn^ri mRNA for heat sh. . . 6tf7 e-173 

aill001010ldbild67017.1ld67017 Mus wagneri mRNA for heat sh. . . 599 e-171 

For the top match^ NM_.013559, Mus. musculus heat shock 
protein 105 kDa, query 1-337 aligned to subject 894-1231 
25 with 329/338 (97%)' identities and 1 gap. 

The best NA alignment to a human sequence wats of query 
1-328 to XM_036358.1/ Homo sapiens heat shock 105 kD, 
• subject 797-1125, with 274/329 identities (83%) and 1 gap. 
For the amino acid sequence corresponding to 
30 NM_013559.1, that is, the Mus musculus 105 kDa heat shock 
protein. See Morozov, et al . , FEES Lett., 371 ( 3 ): 214-18 
(1995), and has the ID of NP_038587.1 (which refers to it as 
a ^^110 kDa'' protein) . 

35 In like manner, additional clones have been identified, 

with the following Northern analyses (ND, nondiabetic; DB, 
diabetic) : 
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P2: ND>DB; Far upstream element (FUSE) binding protein 1 

(SEQ ID NO:23) 
F16: ND>DB; Unknown (SEQ ID NO: 34) 
P21: ND<DB; Unknown (SEQ ID NOS:24r 25) 

P38.: ND>DB; Palmitylated serine/threonine kinase (SEQ ID 
NO:26) 

P39: ND>DB; Phosphotriesterase related protein (SEQ ID NO: 27) 
F40: ND<DB; Similar to mouse testicular tumor differentially 
expressed gene (TDE) 1 (SEQ ID NO:28) 

FUSE binding . protein 1 is discussed in Duncan, et al.. 
Genes & Development, 8:465-80 (1994) • The c-myc proto- 
oncogene stimulated by it is involved in normal cell 
proliferation and programmed cell death. The FUSE is 
required for maximal transcription of c-myc. FBP stimulates 
expression of c-myc in a FUSE-dependent manner and is 
believed to be involved in regulation of c-myc. 

A 305-aa embryo-derived serine/threonine kinase • 
(EDPK) , which we have identified with clone F38/ was 
isolated by Kurioka, et al., Biochim. Biophys, Acta, 1443: 
275-84 (1998) . It is believed to play a role in 
intracellular signaling during embryogenesis . Its catalytic 
domain is 271 a. a. Another serine/threonine kinase, Krct, is 
described in Stairs, et al.. Human Molec. Genet., 7: 2157-66 
(1998) . 

A mouse cDNA (mpr56-l) with .homology to a prokaryotic 
parathion hydrolase (phosphotriesterase) -encoding gene is 
discussed in Hou, et al . , Gene, 168: 157-63 (1996). This 
gene is expressed primarily in liver, and post-natally . .It 
is underexpressed in cystic kidneys. Expression is 
decreased upon acute* renal injury induced by a single 
intraperitoneal injection of folic acid. The protein is 349 
a. a. There is greater than 50% a. a. similarity to the 
procaryotic prathion hydrolase. 

The human TDE homologue.is analyzed in Bossolasco, et 
al., Molec. Carcinogenesis, 26: 189-200 (1999). It is 78% 
homologous to the mouse TDE amino acid sequence. 
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Example 3 

Differential Expre ssion of Genes in Kidneys of Mice a. 
Result of Overproduction of Growth Ho rmone 

In this example, genes were identifieci which were 
differentially expressed in (1) transgenic mice with kidney 
damage as a result of overexpression of bovine growth 
hormone, as compared to (2) nontrans.genic control mice. 

A PGR based cDNA subtraction strategy was used to 
create libraries cont;aining genes potentially involved in 
the progressive development of nephropathy. Total RNA was 
isolated from the kidneys , of bGH- and nontransgenic (NT), 
control mice at 2 months, 5 months, and 12 months of age and 
used to create the two cDNA libraries for each time point. . 
The bGH libraries at each time poi.nt consisted of cDNAs 
upregulated in the bGH as compared to NT mice- The NT' 
libraries consisted of cDNA"s down-regulated in. the bGH as 
compared to NT. mice (i.e., upregulated in the NT as compared 
to the bGH mice) . Subsequently, cDNA clones from each 
library were spotted on replicate membranes. Differential 
hybridization of the two membranes, one probed with the bGH 
library and the other probed with the NT library, was used 
to select differentially expressed clones. Dif f erentiail 
hybridization of clones from the two libraries at 2 months 
of age showed no differential expression. At 5 months of 
age a number of differentially expressed clones were 
observed- At 12 months of age an even larger nximber of 
differentially expressed clones were seen. Thus far. 
Northern Analysis has revealed ,that several clones from the 
12 month bGH library are highly expressed in the kidneys of 
bGH mice as compared to NT controls. Furthermore, their 
expression increases with time, correlating with development 
of glomerulosclerosis . 

The protocols were the same as for Example 1, with the 
following exceptions. 

Mice 

The mice had a B6/SJL genetic background. Inserted in 
their genome was the bGH cDNA clone whose expression was 
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driven by the mouse metallothionein I transcriptional 
regulatory sequences. 

RNA isolation 

Total RNA was isolated from kidneys of 2, 5 and 12 
month old female bovine growth hormone transgenic (bGH) mice 
and non-transgenic controls using the RNA STAT-60 Total' 
RNA/mRNA Isolation Reagent according, to the manufacturer's 
instructions (Tel-Test, Friendswood, TX) . . 

cDNA Subtraction Libraries 

The forward (.bGH) subtracted library included clones 
up-regulated in bGH mice and the reverse (NT) library 
included clones down regulated in bGH mice. 

Screening by differential hybridization 

cDNA arrays of clones from the forward and reverse 
subtracted libraries at each time point {2, 5, and 12 months 
of age) were screened with .probes made from each library of 
the corresponding time point using the PCR-Select 
Differential Screening Kit according to the manufacturer's 
instructions (CLONTECH, Palo Alto, CA) . 

Northern Analysis 

Positive clones, identified by the differential 
hybridization screen, were used as probes in Northern 
hybridization analyses to confirm their differential 
expression. Total RNA isolated from the kidneys of 2, 5 and 
■ 12 months of age female bGH and non-transgenic mice was 
resolved by agarose gel electrophoresis through a 1% ' 
agarose, 1% formaldehyde denaturing gel, transferred to 
positively charged nylon membrane, hybridized in DIG Easy 
Hyb with the cDNA insert labeled by asymmetric PGR with 
Digoxigenin, and analyzed using anti-Digoxigenin-alkaline 
phosphatase conjugate and CDP-Star according to the 
manufacturer's instructions (Roche Diagnostics Corp., 
Indianapolis, IN) . 
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CHARACTERIZATION OF CLONES 

Genes up regulated in the 2, 5 or 12 -month bGH 
libraries are considered unfavorable, and those up-regulated 
in the corresponding NT libraries are considered favorable / 
i.e., likely to encode a protective protein. 

Clone A8 (12 month b6H .Ixbrasry) 

•.Insert 334 bp, query sequence 334 bp (SEQ ID NO: 9) 
By BlastN, the highest similarity was to M217 96.1; 
Mouse (M. spretus) Ig germline Kappa chain gene C-region, 
axon, with a score of 640 bits (E==0.0), and 99% identity of 
query 8-334 with subject 1287-1613. Highest similarity to 
human clone was S65921.1 anticolorectal carcinoma light 
chain ^ glycoprotein CANAG-50 specific IgGl kappa (57 9 bits, 
97% identity) e-163. 

By BlastX, there were a very large number of sequences- 
with the same score (86 bits) and E' value (9e-17) . Almost 
all were immunoglobulin genes. .The best matching human 
clone was AAB28160 . 1 (S65921) anti-colorectal carcinoma light 
chain (86 bits, 97% identity 9e-17) . 

Northern analysis: Expression in kidneys of bGH— 
transgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice- 
Highly expressed in spleen, lung, intestine, stomach, 
kidney, and omental fat; very low levels in liver, heart, 
gastrocnemius, and brain of nontransgenic mouse. 

Cloxie A34 (12 month bGH library) 

Insert size: 503 bp, query 503 bp (SEQ ID NO: 10) . 

By BlastN, this has a score of 636 bits (E~0.0) for a' 
99% identical alignment of query 139-463 to subject 1323- 
999, the subject being emb X03690.1, M. musculus mRNA for Ig 
heavy chain constant region mu(b) allele. The best human 
match was to AF144029.1 Homo sapiens MDM2 gene, intron 9 and 
exon 10, partial sequence (208 bits, 2e-51, 99% identity) . 

By blastX, the score is 224 bits (E=3e-58) for a 99% 
identical alignment to pir A24976, Ig mu Chain C region 
(allele b) of the mouse. The best human alignment was to 
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AM01872.1 Unknown (191 bits, 4e-49) , 84% identity) as well 
as many others. 

Northern analysis: Expression in kidneys of bGH- 
tfansgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 

Clone A3 9 (12 month JbGH Ixbrazry) 

Insert Size: 482 bp, query sequence 482 bp 
(SEQ ID NOill) 

. By BlastN the highest score (879 bits, E=0.0) was with • 
BC0.1Q327.1, Mus musculus^ clon^ MGC:6533, •image:2651776, 
435/457 identity (99%) of query 26-482 with subject * 425-881 . 
Next best (782 bits) was a mouse mRNA for •gamma-2b 
immunoglobulin (emb V00799.1). The best human match was 
S 657 61/ anti-colorectal carcinoma heavy chain glycoprotein 
CANAG-50 specific IgGl kappa ^antibody . the score was 300 
bits (E=5e-79), with 212/233 ■ (90%) identity aligning query 
133-365 to subject 513-^745. 

By BlastX BC010327 was again the highest scorer (233 
bits, E=4e-61), with 117/154 (75% identity) in query 19-480 
vs. subject 122-275- The next best score was to AAA 51630.1 
Ig H-chain (v-region from MPCll) of mus musculus (226 bits, 
8e-59, 75% identity) . Arid the highest human scorer was 
AAB28159.1, corresponding to S657 61. The observed identity 
was 87/157 (55%) . 

For information on gamma 2b transgenic mice,, see Storb, 
et al., Immunol. Res., 13:291-8 (1994), and Ann. NY Acad, 
Sci. 546:51-6 (1988) . 

Northern analysis: Expression in kidneys of bGH- . 
transgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 

Clone A48 (12 month bGH library) 

Insert 430 bp, query sequence 430 bp (SEQ ID NO: 12) 
By BlastN, several sequences produced substantially 

equally significant alignment (E value 0.0): 

BC008237.1 Mus musculus, similar to immunoglobulin 

heavy chain 4 (serum IgGl)-, clone MGC:6522 IMAGE: Z651217 
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mRNA 

BC002121.1 mus musculus similar to immunoglobulin heavy 
chain 4 (serum IgGl), clone mGC:6628 IMAGE:3491766r mRNA 

L35252.1 mouse germ line immunoglobulin gamma-1 (Ighb) 

mRNA 

Best h\iman match was S657 61.1 anticolorectal carcinoma 
heavy-chain=glycoprotein CANAG-50 specific IgGl kappa 
antibody (684 bits, e=0.0, 97%. identity). 

Note that- these were all Ig-related*. 

By BlastX/ the best score was for Genbank AAB59665 
(M60434), score 254, E=le-67. This is an Ig heavy chain 
constant region from Mus domesticus. 116/122 95% identity of 
query 3-368 to subject 227-348. The best human alignment 
was to S65761 anti-colorectal carcinoma (see above) (204 
bits, 3e-53, 94% identity). 

Northern analysis: Expression in kidneys of bGH- 
transgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 

Clone B45 (12 month KT library) 

Insert Size: 610 bp, query sequence 610 bp (SEQ ID 
N0:13) 

By BlastN, the highest scoring similarity was with ref 
NM_010594.1, Mus musculus kidney androgen regulated protein 
mRNA (1170 bits, E=0.0), for 99% identity of query 9-610 
with subject 4-605. Note also the high score for the 
closely related sequence M22810.1 (1168 bits). Best human 
match was to AF319957 Homo sapiens FKSG22 mRNA (1132 bits, 
E=0.0, 99% identity). 

By BlastX, the highest score was for ref NP_034724.1 
and (pl5267) kidney androgen regulated protein (238 bits, E 
of 3e-62), for 100% identity of query 53-415 with subject 1 
121. The rat homologue, ref NP__434689 . 1, scored 103 bits, 
E=le-21. AAG50272.1 corresponds to the best human BLASTN 
match. Homo sapiens FKSG22 mRNA. 

Northern analysis: Expression in kidneys of bGH- 
transgenic mice is less than levels see in age-matched 
nontransgenic control mice and decreases with age. 
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Clone C22 <5 month b6B library) 

query sequence 731 bp (SEQ ID NO: 14) 

By Blastn^ the best alignment is to Genbank 
BC016887.1 Mus musculus clone MGC: 18401 IMAGE; 4242174 mRNA 
and Genbank AF2 6058 0.1/ Mus musculus disabled-2 p96 (Dab2) 
gene/ score of 1243 bits (E 0.0), for 97% alignment of: query 
1-729 to subject 28489-27769. The best human alignment (58 
bits) is to the human homologue/ Genbank AF205890.1 (AFX 
218839S2)/ and AG008846.7/, •E=7e-06, 43/47 identities. . 

No significant Blas't X results, probably in 3' 
noncoding region. 

Either this human sequence, or the much longer mouse 
sequence, could be used as a probe to isolate the full 
length human DNA. 

Dab2 is a widely expressed relative of Dabl/ a neuron-- 
specific signal transduction protein that binds to and 
receives signals from members of the low-density lipoprotein 
receptor family. Dab2 contains a phosphotyrosine-binding 
domain which binds peptides containing the sequence FXN-PXY. 
Dab2 may therefore be both a signal transduction protein and 
an adaptor protein that regulates protein trafficking. See 
Morris, ^^Disabled-2 colocalizes with the CDLR in clathrin- 
coated pits and interacts with AP-2", Traffic, 2(2):lll-23 
(2001) . It also may be a transcriptional activator, see 
Cho, et al., Biochem. J., 352:645-50 (2000). 

This clone was identified as being up-regulated in 
kidneys of 5 month old bGH mice as compared to age-matched 
nontransgenic control mice, but this differential expression 
has not been confirmed by Northern analysis. 

In like manner, additional clones have been identified, 
with Northern analyses (NT, nontransgenic; bGH, transgenic 
for bGH) given below: 



B3: NT>bGH; Unknown (SEQ ID NOs:19, 20) 



B46: NT>bGH; NAS-hypothetical protein-putative (SEQ ID 
NO:33) 
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Example 201 

Use of Soma1:lo Gene Therapy with KAP xdBNA to Protect 
Against Diabetic Nephropathy. 

Specific Aims 

Kidney damage- is a frequent complication of both type I 
and type II diabetes^ often ending in kidney failure, or 
end-stage renal disease (ESRD) . Diabetic nephropathy, the 
single most common, cause of ESRD/ is a progressive disease 
that takes several- years to develop and often goes 
undiagnosed. Our long-term goal is to design specific, 
targeted markers* and therapeutic approaches for the 
diagnosis, treatment, and prevention of human diabetic 
kidney disease. Toward this end, we have identified a gene, 
encoding kidney androgen-regulated protein (KAP) , whose 
expression decreases marTcedly with increasing kidney damage, 
in both a diabetes-dependent and a diabetes-independent 
mouse model of glomerulosclerosis. We believe that this 
,gene, through manipulation of its expression, could provide 
•a means for protecting the kidney from damage. This has' 
tremendous therapeutic potential since KAP is a relatively 
small peptide, is possibly secreted, and appears to have a 
hviman homologue. Based oh our findings, we propose the 
following hypothesis and specific aims . 

Hypothesis: Maintenance of high levels of KAP 
expression will protect the kidney from glomerular 
hypertrophy and diabetic damage. We have observed high 
levels of KAP mRNA isolated from kidneys of nondiabetic 
female mice. In comparison, KAP mRNA levels are 
dramatically decreased in kidneys of streptozotocin (STZ)- 
induced diabetic mice. Furthermore, KAP mRNA levels 
progressively decline with age in nondiabetic bovine growth 
hormone (bGH) -transgenic mice. These mice exhibit 
increasingly severe glomerular hypertrophy and eventually 
glomerulosclerosis, hallmarks of the early stages of human 
diabetic nephropathy. Since low levels of KAP mRNA are 
associated with kidney damage,, we hypothesize that 
maintenance of high levels of KAP expression will protect 
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the kidney from damage. We will test this hypothesis by 
expressing KAP in the kidney using a heterologous promoter 
and then assessing kidney damage after induction of 
diabetes . 

Specific Aim 1: To constitutively express KAP in the 
mouse kidney. KAP expression is normally restricted to 
proximal renal tubule cells and is under the control of 
several different hormones. * In- order to obtain constitutive 
expression, of KAP, the full-length mouse cDNA encoding KAP 
will be placed under the • direction of the kidney-specific 
mouse gamma-glutamyl transpeptidase (GGT) type II promoter 
and transferred into mouse embryos by pronuclear 
microinjection. Individual .lines will be established from 
several founding animals expressing the KAP transgene. In 
anticipation that expression from .the kidney-specific 
promoter might be^lowT additional mouse- lines will be 
generated with the KAP cDNA placed under the direction of 
the stronger mouse metallothionein-I (MT) promoter that 
expresses in several other tissues in addition to the . 
proximal tubuleS. 

Specific Aim 2: To induce diabetes in the KAP- 
transgenic mice and assess the degree of kidney damage. To 
test whether KAP expression can prevent diabetic 
nephropathy, diabetes will be induced in KAP-transgenic mice 
as well as nontransgenic control mice by injection of 
streptozotocin. Glomerular hypertrophy, mesangial cell 
expansion and urinary albumin excretion (UAE) will be 
assessed following a period of hyperglycemia. Observance .of 
kidney measurements for the diabetic KAP-transgenic mice 
that more closely resemble those of nondiabetic rather than 
diabetic nontransgenic controls will support the hypothesis 
that maintenance of KAP expression will protect the kidney 
from diabetic damage. 

Characterization of the kidney androg-en-regulated protein 
gene 

Expression of the KAP gene in mouse kidney is under the 
complex control of several hormones, including androgen. 
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estrogen / thyroid hormone and the growth hormone/insulin- 
like growth factor-I (GH/IGF-I) axis (Cebrian et al., 2001; 
Meseguer and Catterall, 1990; Meseguer and Catterall^ 1992) . 
Expression is confined almost exclusively to epithelial 
cells of the renal proximal tubule (Meseguer and Catterall, 
1987). Expression in the cortical SI and S2 segments is • 
androgen-dependent while expression in the medullar S3 
segment is androgen-independent (Meseguer and Catterall, 
199*0; Meseguer et al-r 1989). Expression of KAP also occurs 
.dn.the mouse uterus, but only during the perinatal period 
(Cebrian et al., 2001; Kasik and Rice, 1993). 

The KAP gene contains four exons and three intervening 
sequences that- span 3807 nucleotides of the mouse genome 
(Niu et al-, 1991) . Its mRNA is .611 nucleotides in length'. 
- (Niu et al-, 1991). KAP gene' regulation occurs primarily at 
the level of .transcription with protein expression closely 
paralleling mRNA expression. The 121 amino acid protein 
product of KAP migrates with an apparent molecular mass of 
20 kDa (Cebrian et al.^ 2001; Meseguer et al., 1989).. A 
hydrophobic N-terminal domain forms a putative 18 amino acid 
signal peptide (Meseguer et al., 1989). The remainder of 
the protein is highly negatively charged and contains amino 
acid clusters similar to those associated with proteins with 
short half-lives (Meseguer et al., 1989). 
Immunohistochemical assays of mouse kidney sections 
identified the protein in the same tissues and at the same 
relative levels as the mRNA (Cebrian et al., 2001). 
Although the function of KAP remains elusive, it has 
recently been determined that the protein specifically 
interacts with cyclophilin B and that overexpression of KAP 
in stably transfected proximal tubule cells significantly 
decreases the toxic effects of cyclosporine A (Cebrian et 
al. , 2001) . 

Importance 

Up to 15.7 million Americans, or approximately 5.9% of 
the population, are thought to have diabetes. While two- 
thirds have been diagnosed, one-third are not aware that 
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they have the disease. Many people first become aware that 
they have diabetes when they develop one of its 
complications^ such as kidney disease, heart disease, nerve 
disease or blindness. Ten to twenty-one percent of all 
people with diabetes eventually develop kidney disease. 
Diabetic nephropathy is the most common cause of ESRD, a 
condition that requires dialysis or a kidney transplant for 
survival. Approximately 40% of the new cases of ESRD are 
attributable to diabetes, and this incidence is* increasing 
more rapidly than any other cause. 

Amino acid sequence analysis suggests that KAP may be a 
small, secreted protein (Meseguer et al., 1989). In 
addition, a human cDNAhas been identified that is 99% 
identical to the mouse cDNA at the nucleotide level and 100% 
identical at the amino acid level. If shown to have- 
protective abilities, KAP could be utilized as a therapeutic 
drug for the prevention of diabetic. nephropathy in humans. 
Preliminary Studies 

We have utilized two different mouse models that 
.develop kidney damage by presumably independent means, the 
streptozotocin. (STZ) -induced type I diabetic mouse and the • 
MT/bGH-transgenic mouse, A PCR-based method of subtractive 
hybridization was used to create libraries that contained 
cDNAs either up-regulated or down-regulated in the damaged 
kidney in comparison to the undamaged control kidney. A 
cDNA encoding KAP was identified in the libraries of down- 
regulated cDNAs from both mouse models of kidney damage. 
Data describing the assessment of kidney damage and showing 
the down-regulation of the KAP gene in the two mouse models 
are presented below. 

Analysis of kidney damage in streptozotocin-induced diabetic 
mice 

Diabetes was induced by multiple injection of STZ as 
described by Bellush et al. (2000). In short, 2-month-old 
female 129 Ola/BalbC mice received daily injections of 85 mg 
STZ/kg body weight until their blood glucose levels were at 
least 250 mg/dl. Control mice received a similar number of 
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injections of citrate buffer without the STZ. Within one 
week/ most of the STZ-injected (DB) mice had blood glucose 
levels of 300-500 mg/dl that were maintained throughout the 
duration of the study. The blood glucose levels of 
nondiabetic (ND) control mice remained below 100 mg/dl. 
Mice were sacrificed 4 or 10 weeks after the onset of 
diabetes. Several parameters reflecting the extent of 
kidney damage were. assessed. Glomerular hypertrophy was 
observed in the DB mice in comparison with the "ND mice at 
both the 4 and 10 wk timepoints. (Fig. Ih) A significant 
elevation of urinary albumin excretion (UAE) was observed 
only at the 10 wk timepoint- (Fig. IB) Both changes are 
indicative of kidney damage. Measurements of mesangial cell 
proliferation, another indicator of damage, are in progress. 



Decreased KAP mRNA expression in diabetic kidneys ■ 

In an effort to identify genes involved in the 
progression of diabetic nephropathy, we performed a PCR- 
based method of subtractive . hybridization using RNA 
extracted from kidneys of streptozotocin-induced (type I) 
diabetic female mice and nondiabetic .controls collected 10 
weeks after the establishment of diabetes. Two cDNA 
subtraction libraries were created, one (DB) containing 
cDNAs up-regulated in the diabetic kidney in comparison to 
the. nondiabetic kidney, and one (ND) containing cDNAs down- 
regulated in the diabetic kidney. cDNAs exhibiting high 
sequence similarity to the KAP nucleotide sequence were 
isolated multiple times from the library of down-regulated 
cDNAs. Northern analysis confirmed the decreased expression 
in the diabetic kidney. 

Analysis of kidney damage in MT/bGH-transgenic mice 

Expression of the bovine growth hormone (bGH) cDNA 
under the direction of the metallothionein-I promoter (MT) 
results in a giant mouse that is approximately 1.5-2 times 
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the size of nontransgenic mice {Chen et al., 1990; Quaife et 
ai.r 1989). These MT/bGH mice develop severe kidney damage 
and die prematurely (Quaife et al., 1989). We have assessed 
several parameters reflecting the extent of kidney damage in 
these mice as a function of age. Significant increases in 
glomerular volume (as determined by Analysis of Variance 
(ANOVA) ; P<0.05) were observed in the MT/bGH mice in 
comparison with the nontransgenic. (NT) mice at the 2 and 5 
month timepoints but not at 12 months (Fig. 2A) . A 
significant increase in mesangial cell area was observed 
only at the 12 month timepoint (Fig. 2 B) . A significant 
elevation of urinary albumin excretion (UAE) was also 
observed only at the 12 month timepoint (Fig. 2C) . Light 
microscopic images of NT and bGH kidney sections show 
glomerular hypertrophy in the bGH kidney sample along with a 
thickened Bowman's capsule. The observed changes seen in 
the bGH mice are indicative of kidney damage. 

Decreased KAP mRNA expression in the kidneys of MT/bGH- 
transgenic mice 

In an effort to identify genes involved in the kidney 
damage resulting from expression of the bGH transgene, we 
performed a PCR-based method of subtractive hybridization 
using RNA extracted from kidneys of 12-month-old MT/bGH- 
transgenic mice and their nontransgenic littermates- Two 
cDNA subtraction libraries were created^ one (bGH) 
containing cDNAs up-regulated in the MT/bGH kidney in 
comparison to the nontransgenic kidney, and one (NT) 
containing. cDNAs down-regulated in the MT/bGH kidney., A 
cDNA exhibiting high sequence similarity to the KAP 
nucleotide sequence was isolated from the library of down- 
regulated cDNAs. Northern analysis confirmed the decreased 
expression. in the MT/bGH kidney as a function of age that 
correlated with degree of kidney damage. 



Research Design and Methods 

We have demonstrated that KAP mRNA expression is 
inversely correlated with kidney damage (i.e. KAP expression 



wo 2004/013289 



109 



'CT/US2003/0240S3 



decreases with increasing kidney damage) . This decrease in 
KAP gene expression could either be the cause of the kidney 
damage or could result from the kidney damage. One way to 
test the former possibility would be to knock out expression 
5 of the KAP gene and assess whether kidney damage occurs 

independently of diabetes. In a converse experiment^ high 
levels of KAP expression could be maintained and the kidneys 
assessed for protection from damage when diabetes is 
induced. The second experiment would further • support KAP as 

10 a therapeutic drug for the prevention of diabetic 

. nephropathy if shown to be protective while the first 
•experiment would further support KAP as a drug target for 
the prevention of kidney damage. Due to the practical, 
outcome of the second experiment, we propose to test the 

15 hypothesis that maintenance of high levels of KAP expression 
will protect the kidney from glomerular hypertrophy and 
diabetic damage. To do this, we will create a transgenic 
mouse line that expresses • KAP in the kidney under the 
direction of a heterologous promoter such that KAP 

20 expression will no longer decrease with increasing kidney 
damage. We will then challenge the kidney by inducing 
diabetes in the KAP-transgenic mouse and assessing the 
degree of kidney damage. The hypothesis will be considered 
proven if the kidneys of diabetic KAP-transgenic mice more 

25 closely resemble nondiabetic rather than diabetic 

nontransgenic control kidneys. Confirmation of the 
hypothesis would further support of KAP as a therapeutic 
drug to prevent diabetic nephropathy. 

30 Specific Aim 1: To constitutively express KAP mRNA in the 
mouse kidney 

Rationale 

It has been demonstrated that KAP mRNA levels decrease 
35 with increasing kidney damage. In order to prevent this 
decrease in expression, the KAP cDNA will be placed under 
the control of a heterologous promoter that should not be 
affected by the state of diabetes (use of the term 
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^^promoter" in this proposal includes the region of the gene 
required to initiate transcription as well as regions that 
•regulate expression). Although a n\ainber of genes are 
expressed in the renal proximal tubule, very few have been 
shown to be expressed only in those cells. Furthermore, 
very few of the promoters of these genes have been 
characterized well enough to demonstrate cell-specific 
expression of a reporter gene. A search of the literature 
has indicated two different promoter regions that may 
provide suitable transcriptional • regulation . The first, a 
34 6-base pair region from the type II promoter of the mouse 
gene encoding gamma-glutamyl transpeptidase (GGT) , has been 
shown to direct expression of a B-galactosidase reporter 
gene specifically in the renal proximal tubules of 
transgenic mice (Sepulveda et al., 1997). Although the GGT 
-promoter has* the" correct tissue-specific expression, there 
is some question as to whether the level of expression will 
be adequate. The second promoter selected is from the mouse 
metallothionein-I gene. Although it was reported to direct 
expression of human growth hormone-releasing hormone in 
renal proximal tubules, it also directed expression. in a 
number of other tissues, including pancreas, lung, "heart and 
brain (Brar et al., 1989). Since an ^'ideal'' promoter has 
not been found, both of these promoters will be used and the 
results compared. 

. Experimental design 

a) Construct a. plasmid fusing the KAP cDNA to a 
heterologous promoter that will drive KAP expression in the 
kidney. An expression plasmid routinely used in our 
laboratory will serve as a starting point for constructing 
the heterologous promoter/KAP cDNA fusion. This plasmid 
utilizes the mouse metallothionein-I promoter to initiate 
transcription. The plasmid also has a bGH polyadenylation 
signal sequence that defines the site of poly A addition. • 
The 611 bp cDNA for KAP, isolated from our subtraction 
libraries, will be inserted between the promoter and the 
polyadenylation signal sequence, generating the MT/KAP 
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fusion construct. Nucleotide probes complementary to the 
bGH sequence just upstream of the poly A addition site will 
be used to distinguish transgenic KAP mRNA expression from 
endogenous KAP mRNA expression. A protein tag will not be • 
added to the protein sequence since, without a known 
function for KAP, it will be impossible to know if the tag 
interferes with function. The.plasmid also has a bacterial 
origin of replication and an ampicillin resistance gene for 
propagation in B. coil- Replacement of the MT promoter 
region with the 34 6 bp .GGT' promoter. (Sepulveda et al., 1997) 
will generate the GGT/KAP fusion • construct • All manipulated 
regions of the expression plasmids will be sequenced to 
confirm that no errors were introduced, 

b) Confirm mRNA and protein expression from the fusion 
construct In cell culture. Before injection into mice, 
expression from each KAP fusion construct will* be confirmed 
in cell culture. Mouse L cells, used routinely in our 
laboratory, will be transiently transfected with the KAP 
fusion constructs using lipofectin. Transfected and 
nontransfected control cells will be harvested and either 
RNA or protein extracted. RNA will be analyzed by Northern . 
blot hybridization as described in the Preliminary Results 
section using a probe complementary to the bGH sequence at 
the 3' end of the KAP transgene message. Protein will be 
analyzed for KAP expression by Western blot analysis using 
monoclonal antibodies as described by Cebrian et al., 2001 
(2001) . Since a protein tag will not be utilized for the 
transgene, protein expression paralleling the transgenic 
mRNA expression and differing from normal expression will be 
evaluated.' In the event that the GGT promoter is not 
functional in mouse L cells, a mouse proximal tubule cell 
line will be tried (Sepulveda et al., 1997). Our laboratory 
has considerable experience performing Northern and Western 
blot analysis - 

c) Create transgenic mouse lines carrying the KAP 
transgene. The pronuclear microinjection technique into 
single-cell fertilized embryos will be utilized to deliver 
the KAP transgene into the mouse genome. Our transgenic 
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mouse facility has been using the technique since the mid- 
1980s (McGrane et al.^ 1988). Since then, our facility has 
optimized the technique and created hundreds of transgenic 
lines for Ohio University researchers as well as outside 
researchers. Pups obtained from the microinjections will be 
screened for the presence of the KAP transgene by DNAslot 
blot and Southern- blot hybridization. The number of 
integrated KAP transgene .copies will be quantitated by 
comparison to known copy .number standards. Transgenic lines 
will be established by crossing the transgenic founders to • 
nontransgenic mice and inheritance of the KAP transgene 
confirmed by DNA slot blot hybridization. These techniques 
are performed routinely in our laboratory. 

d) Confirm mRNA and protein expression from the fusion 
.construct in the KAP-transgenic mouse- lines. Fl progeny 
from the KAP transgenic lines will be assayed for KAP 
transgene expression. Transgenic mice and their 
nontransgenic littermates will be sacrificed by cervical 
dislocation and several organs (kidneys, pancreas, lu|cig, 
heart and brain) immediately frozen. RNA or protein will be 
extracted from the tissues and then analyzed by Northern or 
Western blot analysis as described above. Tissues other 
than kidney will be included in order to determine the 
degree of tissue specificity. Lines expressing the highest 
levels of KAP from the transgene will be expanded for 
further analysis (see below) . 

Analysis 

The experiments proposed for Specific Aim 1 should 
result in generation of two different transgenic mouse lines 
that express KAP in renal proximal tubules. However, 
several potential problems could be encountered. First, the 
selected promoters may not direct sufficient or appropriate 
expression. One way to increase expression from the MT 
promoter is to supplement the mouse drinking water with ZnSO 
(Eisen et al., 1998). Other promoters that could be 
considered include the mouse phosphoenolpyruvate 
carboxykinase (PEPCK) promoter or the cytomegalovirus (CMV) 
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promoter. Although the PEPCK promoter has been shown to 
direct expression in renal proximal tubules^ as well as 
other tissues (Short et al.^ 1992), it was questionable as 
to whether induction of diabetes would influence its ability 
to direct transcription. Likewise, although CMY does direct 
expression in the kidney, it was not clear whether 
expression occurred in the proximal tubules . 

Another potential concern .is use- of the KAP cDNA for 
expression. * Sometimes we. find that inclusion of an intron * 
increases expression. Alternative constructs could include 
the 126 bp second intron of KAP or .the 3807 bp genomic clone 
(Niu ef al., 1991) . 

Specific Aim 2: To induce diabetes in the KAP-transgenic 
mice and as"sess* the degree of kidney damage 

Rationale 

In order to determine if KAP mRNA expression (and 
subsequent protein production) can protect the kidneys of 
diabetic mice from damage, diabetes* will be induced in the 
KAP-transgenic mice generated under Specific Aim 1. After 4 
to 10 weeks of diabetes, diabetic and nondiabetic KAP- 
transgenic and nontransgenic mice will be sacrificed and 
their kidneys examined. If the parameters measured for the 
diabetic KAP-transgenic mice (glomerular volume, mesangial 
cell expansion, and urinary albumin excretion) are 
significantly dif f erent . f rom diabetic nontransgenic mice and 
in fact more closely resemble those for nondiabetic, 
nontransgenic mice, then the hypothesis will be considered 
proven with evidence suggesting the use of KAP as a . 
therapeutic drug for the prevention of diabetic kidney 
damage . 

Experimental design 

Diabetes will be induced in twenty-five 2-month-old 
KAP-transgenic and nontransgenic littermate females by 
multiple injection of STZ as described in the Preliminary 
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Results section. Nondiabetic controls (twenty KAP- 
transgenic and twenty nontransgenic littermates) will be 
injected with citrate buffer only. After 4 or 10 weeks of 
hyperglycemia (blood glucose levels 300-500 mg/dl) , urine 
will be collected from diabetic KAP-transgenic mice and the 
three control groups (diabetic nontransgenic ^ nondiabetic 
KAP-transgenic and nondiabetic nontransgenic; 8-10 
mice/group). The itdce will be sacrificed by cervical 
dislocation and tissues (kidneys, pancreas, lung, heart, 
brain, and liver) immediately removed. • The right kidney 
will-.be quickly frozen for later .RNA and protein extraction 
and the left kidney will be bisected and placed in 4% 
paraformaldehyde. RNA and protein will be examined by 
Northern and Western blot analyses to determine levels of 
transgenic and endogenous KAP expression as described above. 
Glomerular volume, mesangial cell expansion ""and 'UAE will be 
measured as described in the Preliminary Results section. 
Statistically significant differences will be determined by 
ANOVA. 

Analysis 

Performance of Specific Aim 2 should establish whether 
expression of KAP in kidney using a heterologous promoter 
can protect the kidney from diabetic damage. If KAP 
expression protects the kidney, then the measurements of 
glomerular volume, mesangial cell expansion and UAE should 
not differ significantly between diabetic, KAP-transgenic 
mice and nondiabetic, nontransgenic mice. Instead, there 
should be a significant difference between diabetic, KAP- 
transgenic mice and diabetic, nontransgenic mice. 

One potential problem* that might be encountered is an 
effect of diabetes on expression by the heterologous 
promoters. Northern and Western blot analyses of several 
different tissues, including kidney, should help determine 
if this is a problem. Also, it is unlikely that both 
promoters will be affected. 

Another possible scenario is that expression of the 
KAP-transgene will affect the measured kidney damage 
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parameters (glomerular volume, mesangial cell expansion and 
UAE) differently. For example, diabetic, KAP-transgenic 
mice may no longer exhibit mesangial cell expansion and 
increased UAE (parameters that typically change during the 
later stages of diabetes in diabetic, nontransgenic mice; 
see Preliminary Results section), but may exhibit glomerular 
hypertrophy. This differential effect may help pinpoint the 
role of KAP in the kidney. Interestingly, in contrast to 
our results, KAP mRNA was reported as being slightly 
increased (1.5 X) in remnant kidneys exhibiting glomerular 
hyperfiltration two weeks after undergoing subtotal renal 
ablation (5/6 nephrectomy) (Zhang et al., 1999). A possible 
explanation is that KAP expression increases during the very 
early stages of diabetes (prior to our earliest timepoint, 
perhaps in an- effort to protect the kidney) but then 
decreases as diabetes progresses. 

Another potential problem is that the heterologous 
promoters may not be* able to drive sufficient expression of 
KAP for it to protect against diabetic damage. KAP appears 
to be highly expressed in our female mice, as shown in the 
Preliminary Results section. We have not assayed KAP 
expression in our male mice, but reports in the literature 
indicate that KAP ekpression is highest in males and lower 
in females and castrated males (Cebrian et al . , 2001). 
Achievement of even higher levels of KAP may require 
characterization of a similarly expressed gene promoter, 
such as that for the gamma phosphorylase kinase gamma- 
subunit (Takenaka et al . , 1998). 

The possibility also exists that increased KAP 
expression will not protect the kidney from damage, but that 
the decreased expression results from the development of 
kidney damage. If this appears to be the case, then 
decreased KAP expression could potentially be used as a 
diagnostic marker for kidney damage. 
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Future direction 

As demonstrated in the Preliminary Results section, KAP 
mRNA expression also decreases with increasing kidney damage 
as a function of age in MT/bGH-transgenic mice. Since these 
mice do not become diabetic, their kidney damage presumably 
occurs via a different mechanism. Thus, they provide an 
independent model for the study of progressive glomerular 
hypertrophy and glomerulosclerosis. Although completion 
lies beyond the time frame of this proposal, we hope to 
initiate a study that combines the bGH transgene and the KAP . 
transgene in a double-transgenic mouse through genetic 
breeding. The double-transgenic mice, and single-transgenic 
controls, will, then be sacrificed at 2, 5, or 12 months of 
age and their kidneys examined for ECAP transgenic and 
endogenous expression and .for glomerular hypertrophy and 
mesangial cell expansion as described for Specific Aim 2. 
Data demonstrating that KAP expression protects the kidney 
from damage due to the bGH transgene would further support 
and even broaden the role of KAP in protecting the kidney 
from damage. 

Adaptation to Human Therapy 

To adapt the gene therapy system for humans, we could 
use somatic cell gene therapy. Any vector that delivers DNA 
into the nucleus (i.e. viral or non-viral methods) as well 
•as any promoter that could direct constitutive, high levels 
of expression could potentially be used. Since KAP may be a* 
secreted protein, it may not be important to direct 
expression only to kidney cells. Although we anticipate 
that KAP would protect the kidneys, it could possibly 
protect other organs as well. KAP would preferably be 
administered/maintained as long as the patient is diabetic 
and susceptible to kidney damage. Administration route 
could be any of the following: subcutaneous, intravenous, 
intramuscular . 

KAP could potentially be used in humans as a 
^'conventional'' pharmaceutical, especially if it is a 
secreted protein and thus would normally be found 
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circulating throughout the body. The administration route 
could be any that would keep the protein intact (e.g. 
subcutaneous / intravenous, iiitraniusGular) . 
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Citation of documents herein is not intended as an 
admission that any of the documents cited herein is 
pertinent prior art^ or an admission that the cited 
documents is considered material to the patentability of any 
of the claims of the present .application. All statements as 
to the date or representation as to the contents of these 
documents is based on the information available to the 
applicant and does not constitute any admission as to the 
correctness, of the dates or contents- of these documents. 

The appended claims are .to- be treated as a non-limiting 
recitation of preferred .embodiments . • 

In addition to those, set forth elsewhere^ the following 
references are hereby incorporated by reference^ in their 
most recent editions as of the time of filing of this 
application: Kay^ Phage Display of Peptides and Proteins: A 
Laboratory Manual/ the John Wiley, and Sons Current Protocols 
series r including Ausubelr Current Protocols in Molecular 
Biology; Coligan^ Current Protocols in Protein Science/ 
Coligan^ Current Protocols in Immunology/ Current Protocols 
in Human Genetics/ Current Protocols in Cytometry/ Current 
Protocols in Pharmacology/ Current Protocols in 
• Neuroscience/ Current Protocols in Cell Biology/ Current 
Protocols in Toxicology/ Current Protocols in Field 
Analytical Chemistry/ Current Protocols in Nucleic Acid 
Chemistry/ and Current Protocols in Human Genetics/ and 
the following Cold Spring Harbor Laboratory publications: 
Sambrookr Molecular Cloning: A Laboratory Manual/ Harlow^ 
Antibodies : A Laboratory Manual/ Manipulating the Mouse 
Embryo: A Laboratory Manual/ Methods in Yeast Genetics: A 
Cold Spring Harbor Laboratory Course Manual/ Drosophila 
Protocols/ Imaging Neurons: A Laboratory Manual/ Early 
Development of Xenopus laevis: A Laboratory Manual/ Using 
Antibodies : A Laboratory Manual/ At the Bench: A Laboratory 
Navigator/ Cells: A Laboratory Manual/ Methods in Yeast 
Genetics: A Laboratory Course Manual/ Discovering Neurons: 
The Experimental Basis of Neuroscience/ Genome Analysis : A 
Laboratory Manual Series / Laboratory DNA Science/ 
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Strategies for Protein Purification and Characterization: A 
Laboratory Course Manual/ Genetic Analysis of Pathogenic 
Bacteria: A Laboratory Manual/ PGR Primer: A Laboratory 
Manual/ Methods in Plant Molecular Biology: A Laboratory 
Course Manual / Manipulating the Mouse Embryo: A Laboratory " 
Manual/ Molecular Probes of the Nervous System/ Experiments 
with Fission Yeast: A Laboratory Course Manual; A Short 
Course in Bacterial Genetics : A Laboratory Manual and 
Handbook for Escherichia coli and. Related Bacteria; DMA 
Science: A- First Course' in. Recombinant DNA Technology; 
Methods in Yeast Genetics: A Laboratory Course Manual/ 
Molecular Biology of Plants: A Laboratory Course Manuals 

. All references cited- herein, including journal articles 
or abstracts r published, corresponding, prior or otherwise 
related U.S. or foreign' patent applications, issued U.S. or 
foreign patents, or any other references, are entirely 
incorporated by reference -herein, including all data, 
tables, figures, and text presented in the cited references. 
Additionally, the entire contents of the references cited 
within the references cited herein are also entirely 
incorporated by reference. 

Reference to known method steps, conventional methods 
steps, known methods or conventional methods is not in any 
way an admission that any aspect, description or embodiment 
of the present invention is disclosed, taught or suggested 
in the relevant art. 

The foregoing description of the specific embodiments 
will so fully reveal the general nature of the invention 
that others can, by applying knowledge within the skill of 
the art (including the contents of the references cited 
herein) , readily modify and/or adapt for various 
applications such specific embodiments, without undue 
experimentation, without departing from the general concept 
of the present invention. Therefore, such adaptations and 
modifications are intended to be within the meaning and 
range of equivalents of the disclosed embodiments, based on 
the teaching and guidance presented herein. It is to be 
understood that the phraseology or terminology herein is for 
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the purpose of description and not of limitation^ such that 
the terminology or phraseology of the present specification 
is to be interpreted by the skilled artisan in light of the 
teachings and guidance presented herein^ in combination with 
5 the knowledge of one of ordinary skill in the art. 

Any description of a class or range as being useful or 
preferred in the practice of the invention shall be deemed a 
description of any subclass fe.g. , a disclosed class with 
one. or more disclosed members omitted) or subrange contained 

10 therein^ as well as a separate description of each 
individual member or value in said class or range. 

The description .of preferred embodiments individually 
.shall be deemed a description of any possible combinsttion of 
such preferred embodiments^ except for combinations which 

15 • are impossible fe.g, mutually exclusive choices for an 

element of the invention) or which are expressly excluded by , 
this specification. 

If an embodiment of this invention is disclosed in the 
prior artr the description of the invention shall be deemed 

20 to include the invention as herein disclosed with such 
embodiment excised.' 
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Table 1: Sequence of Clone G26, related to hydroxysteroid 
dehydrogenase-4 

lA: beginning of coding strand (SEQ ID NO:l) 

5 

actcagctcctacnggntgcttgtgtggaagccaacgtgccaacattcatctacagcagc 
tcagtgcttgtggctggaccaaattcctacaaggagatcatcctgaatggccatgaggag 
gagcatcatgaaagcacatggtctaacccatacccatgnnnngtgaaagatggctgagaa 
ggcagtgctggcaacccntgggagcatcctgaaaaatggtggcactttgcatacttgtgc 
. 10 cttaagactctctttcatctatggggaagaatgccaagtcacttcaaccactgtgaaaac 
agcactgaagaacaacagcataattaagaaagatgccacattctccattgccaacccagt 
. gtatgtgggcaatgcancctgggctcacattctggctgccaggagcctacaggaccctaa 
aaagtccccaagcatccaaggacagttctattacattaacagatgacacccctcaccaaa 
gcnatgatgacttnaaaatgcaccctgagcaaggagtgggggcccncccccttganacca 
15 actgggaagtctttcctcrfcgccccctgctntactnggcnttgccttncctccccnggaaa 
.actggggaaccttcctgctaccnnccannctaccaacttannnaaccccnttttaaaccc 
cccnttggaccnccanngnnaaaaaaangnnntncncccnntcctaccaaaaaggcncaa 
• cnnnnaacnnggcnttnancnnnttng 

20 IB: end of coding strand, obtained by sequencing opposite 
strand, deducing complement, and arranging in 5* -3' order 
(SEQ ID NO: 2) 

ngggcttncggcccnnananntcccaagcnnccaggnnggtttnttcctttcnangnncc 
25 , cctcaccaaaagtatgntnatttaaaagacccttgancaaagantggggccttngctttg 
atacanntggagtntttctutggcccnnctncnatnggcntgctttcntcttnaaactnt 
gagnntccnggntacgtccagtnttncaaaataaaancccgcccatttaaccgcctcttg 
atcacagtgnttaaanagtgatccacctctcctanaagaaggtcagngagatttgggcta 
tgagccaacttgtcagctgggaggnagccaagcagaaaacctcagagtggatcggaaaca 
30 ctagtgatgcagcacagggagattggaaacaaaaagtcncagtgatatgaagagggagag 
gacatggccctgggtgttattaggtcctccagaaagggacttagaaacaatccaactctt 
aacaattccattttacactctgtccaacttgtctttgggtcaccagaagccttgcaagtc 
actggcccagttgcaacccttcagctntaagcacttgtccagcaatgcacaagatgtgcc 
tcagctgcngtgacccaaggatgggtggctgatagtgagttgcctggaacctcttgtagg 
35 ttagaatttcatcnnggcttccacatctcctttccatgtgccaacgcatttcgtgtctga 
gaaaattcccatacctttatgaagctcaaagaatagaacacataaaatcttttaatgcgt 
aaaaaaaatnaaaaaaaaa 



40 

TaJble 2: Sequence of Clone H8 (SEQ ID NO:3), related to 
vacuolar adenosine triphosphatase subunit D; 

acttcaatgtggaccatggctacctggagggcctggttcgaggatgcaaagccagcctcc 
45 taactcagcaggactatgtcaacctagtgcagtgtgagacctcggaagacctgaaaattc 
atctccagaccacggactatggcaacttcctggctaatgaaacaaatcctctcactgttt 
ccaaaattgacacggagatgaggaagaagttctgcagagagtttgactatttccggaatc 
attccttggagtccctgagcacatttctcacctacatgacatgcagctatatgatagaca 
atataattctacttatgaatggggccttgcaaaagaaatctgtgaaagaagttctagcca 
50 agtgtcacccactgggccgtttcacagagatggaagctgtcaacattgcagagaccccct 
cagatctcttcaggcttgtgctggttgaaacaccattagctccattctttcaagattgta 
tgtcttgaaaacacctcttgatgaacttgaatattgaattactgcgcaataaactataca 
agtcttaccttgaggcattctacaaattctgcaaggatcacggtgatgtcacagcagacg 
ttatgtgtcccattcttgagtttgaggccgacagacgcgctttaatcatcactctgaact 
55 catttggcactgaactaagcaaagaagacagggagaccctcttccccacctgcggcaggc 
tctatccagaggggttgcggttgttagctcaagctgaagactttgagcagatgaagagag 
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tggcagataattatggagtttacaagcctttgtttgactgctgtcggtggcagtgggggg 
aagacactggaagacgttttctatgagagagaggt 

Table 3: Sequence of Clone HI (SEQ ID NO:4), related to 
ubiquitin protein ligase Nedd-4 

acatacgtgtttacggagttcattatgtttacagattaagcgaatttctgtagttgcatt 

tttatatttttagtatcacattagtataaaaatttgtttaaaatagccaaagagtagttc 

aatgcataatttgtatgaatttgttaccaagtttcttatgctttctaaaaaatactgttt 

cctatgaaaattatgtttaatcaaaagtcaagaacccttggggcaaaatgctacaggtgg 

agtcccaccatagtcacctttgaggcacaggaagaggttccatgcatgaatctgcacaca 

tgagcagtgctggctgctgtgcaccgggcccttgggatgaagccatcagcactgacatcc 

agccccgtettcatcctcaggcttagaaacaagtgggatggtgactttacagtcagtggg 

aagtaagagcccgcagaataacaaaggcagaaggacaggaaagcgaggtccgagtccgac 

tgtgaggtcatatctggccaaggctccttctagtcagcttacactggtgattatacggaa 

aatgctcttagatatgagaaaagcaatgtggcatgtaatgagttagtgtaagcttgtgtt 

ataaactttccgaatgcttataagtagcctgtgggagctgtaagaaatacttagggccat 

tgatttaagcttgttggctctaacatgactaatacctgtcagcccatggtagggatttca 

gtgacagtagtggaaacgctttccacagtccacaccactatgttgatcggtgaaagacac 

ccccaggccagtgcatgcttccagagtgtcctgatcagtcagccaggcatggGcaccgtc 

cagtgttgatttttatcattagccagctgagcctgcagcacggggctcatttcccogtct . 

ggcagatgtggatagctgggcccactcattctgtctctcaaggctatggtaagttcttaa 

-aggggt 

Table 4: Sequence of Clone Fl (SEQ ID NO:5), related to 
truncated SON protein.. 

acaagcccgacttgaaagaggcctccagaaaaagtagatgcgtatctgtgcaaacagatc 

ctactgatgaagtgcccaccaaaaagtcaaagaagcataaaaagcacaaaaacaaaaaga 

agaaaaagaagaaagaaaaagaaaaaaaatataaaagacaaccagaagaatctgagtcca 

agctgaaatctcatcatgatgggaatctagaatctgattccttcttaaagtttgattctg 

aaccttcagcagcggcact-ggaacatcctgtaaga'gcgtttggettatctgaggccagtg 

agaccgccttagtgctgga-acctccagtagtctcaatggaggttcaggagtcacacgttc 

tagagactctgaagccagctacaaaagctgcagaactgtcagttgtgtctacatcagtaa 

tctcagagcagtctgagcagccgatgccaggtatgctggaaccatcaatgacaaagatcc 

.tggattccttcacagcagcaccagtggcctatgtcaacagcggcgctgaagtcacctgag 

ccccgttgtaacaaatgtcagtggaatatcagaagtctgtgctgaaatctttggagacta 

tgcctccagagacgtGaaagaccacgctggtagagcttcccatagcaaaagtggttgagc 

catcagaaaccctcacgatagtgtcagagacacctactgaggtgcaccctgaaccaagcc 

catcaacaatggattttccagagtcatctacaactgacgtgcaaagattgcqagagcagc 

ctgtagaagcaccatcggagattgcagattcatccatgacaagacctcaggagtcactgg 

agctgcctaagaccacagcggtggagctgcaggagtcaacggtggcctcagctctggagt 

tgccggggccacctgcgacctccattctggagttgcaggggccccctgtgact'ccagtgc 

cagagttgcctgggccctctgccaccccagtgccagagttgtcagggcccctttctaccc 

cagtgcctgagttgccagggccccctgcgacagtggtccctgagttgccggggccctctg 

tgacaccagtgccacagttgtcgcaggaattgccagggcctccagcaccatccatggggt 

tggagccaccacaggaggt 

Table 5: Sequence of Clone F4 (SEQ ID NO:6), related to 
kidney androgen regulated protein 

acgcggggtaatttctttctcctgttcactggttgccttaaccctactaaagcatgatgc 
ttttcaaggtcctggtgatcactgtctcctgtggtctgactgtggctttccccctgtcag 
aattagtttcaatcaataaagaactacagaattcaatcattgacctactaaactcagtct 
ttgaccaactgggatcatacagagggacaaaagctcctctagaggattatacagatgatg 
atttaagcactgactctgagcagatcatggacttcacgccagccgcaaacaaacagaatt 



• 
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ctgagttctctactgatgttgagacagtctcctccggctttctggaagaattcactgaga 
acacagacatcacagtgaaaattccattagctgggaatccagtctcccctacttcctgag 

aagactccaactttgaaataaacctgcgactgccattcaggatattgtctttgtagaaaa 
5 tgatataaatctaagatgctcccacaaagaagaaagaacgaggaagtgatcctggatgag 
aaatgctgtgccaggcatctctaagctctacccttactntgtgtgggtttaagaaataaa 
ccgttcatttgattgc 



10. • 

Table 6: Sequence of Clone F5 (SEQ ID N0:7)^ related to 
claudin-10 

acgcggggnagtagggaattagaaagattattctaagctatctctaagatgcccttagtg * 
.15 tgatatactttattctaatttcccacacttcaagccatgagattatgaagaattgtcacc 
• .tac.cctcatgggctggctttaacattcgtggaagttttgtccagtgtctgcatgcctagg 
.ccaccaaagcc.ttctgtgtggactgtggcagcaggcaaggctgagcgacatgtccagggc 
acagatctcagctctggtgtgtggtgttggagggtttggtgctctcgtcgctgccaccac 
atccaacgaatggaaagtgaccacccgagcgtcgtctgtgattaccgccacctgggttta 
20 . ccagggtctgtggatgaactgcgcaggtaacgctctgggctccttccactgccggccaca 
tttcactatcttcaaagtagaaggttacatccaggcatgtagaggactaatgatcgctgc 
ggtcagcctgggatttttcggttccatttttgcactctttggaatgaaatgtaccaaagt 
cggaggctcagatcaagccaaagctaaaatccgcttgctttggcccgggattggattcat 
attgncaggtctggggttccatgacaggctgttcccttgtntgcaaccaaatcncaacng 
25 aattctttgatcctntttatatggagcaaaagtttgaattaggggcttgtttttttatcg 
gatgggcngaacttttttntgntcatgggggagnatatttggttttaatatccacacacc 
aacnccaaaagggtacnntcaanggncccnttngcatgtttttggaccagtttaggn-gna 
aagaaaaaaaaaaaaaaaaaancctngccnancctagggan 

.30 



Table 7: Sequence of Clone F6 (SEQ ID N0:8) , related to 
kidney androgen regulated protein 

35 

aacttctttctcctgttcactggttgccttaaccctactaaagcatgatgcttttcaagg 
tcctggtgatcactgtctcctgtggtctgactgtggctttccccctgtcagaattagttt 
caatcaataaagaactacagaattcaatcatcgacctactaaactcagtctttgaccaac 
tgggatcatacagagggacaaaagctcctctagaggattatacagatgatgatttaagca 

4G ctgactctgagcagatcatggacttcacgccagccgcaaacaaacagaattctgagttct 
• ctactgatgttgagacagtctcctccggctttctggaagaattcactgagaacacagaca 
tcacagtgaaaattccattagctgggaatccagtctcccctacttcctgagaagactcca 
actttgaaataaacctgcgactgccattcaggatattgtctttgtagaaaatgatataaa 
tctaagatgctcccagcaaagaagaaagaacgaggaagtgatcctggatgagaaatgctg 

45 tgccaggcatctctaagctctacccttactctggggtgggtttaagaaataaacgttcat 
tttgaattccc 



50 Table 8: Sequence of Clone A8 {SEQ ID NO: 9) related to an 
immunoglobulin sequence 

acgcgggtgagcagcaccctcacattgaccaaggacgagtatgaacgacataacagctat 
acctgtgaggccactcacaagacatcaacttcacccatcgtcaagagcttcaacaggaat 
55 gagtgttagagccaaaggtcctgagacgccaccaccagctccccagctccatcctatctt 
cccttctaaggtcttggaggcttccccacaagcgacctaccactgttgcggtgctccaaa 
cctcctccccacctccttctcctcctccctttacttggcttttatcatgctaatatttgc 
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agaaaatattcaataaagtgagtctttgcacttg 



Table 9: Sequence of Clone A34 (SEQ ID NO:10)/ related to an • 
immunoglobulin sequence 

• acgttgtaaaacgacggccagtgaattgtaatacgactcacttatagggcgaattgggcc 
10 ctctagatgcatgctcgagcggccgccagtgtgatggatatctgcagaattcgccctttc 
. gagcggccgcccgggcaggtacagtgtgggtttaccagtggacttgtccacggtcctctc 
ggtcaccaggtgtggcagggcctcgtggcctacaacacaggtataggtctctccggagtt 
ccattcctcctctgtcacagtcaggatgctgtgggtaaagtagaagcctggggccccagg 
ctctggcatcggggcactggtcacatacttctcttggggcaagagttgccctccctgaag 
15 • ccactgcacactgatgtctgcaggagagaagcccttcaccaggcaggtgactgtggctga 
ctccctcaggttcagttgctcacgagctggtggcagcaggtacctcggccgcgaccacgc 

taagggcgaattccagcacactg 



20 • 

Table 10: Sequence of Clone A39 (SEQ ID N0:11)/ related to 
an immunoglobulin sequence 

acaagatggggactgggacgggcctttgactactggggccaaggcaccactgtcacagtcr 
25 tcctcagccaaaacaacacccccatcagtctatccactggcccctgggtgtggagataca' 
actggttcctctgtgactctgggatgcctggtcaagggctacttccctgagtcagtgact - 
gtgacttggaactctggatccctgtccakcagtgtgcacaccttcccagctctcctgcag 
tctggactctacactatgagcagctcagtgactgtcccctccagcacctggccaagtcag 
accgtcacctgcagcgttgctcacccagccagcagcaccacggtggacaaaaaacttgag 
30 cccagcgggcccatttcaacaatcaacccctgccctccatgcaaggagtgtcacaaatgc 
ccagctcctaacctcgagggtggaccatccgtctttatcttccctccaaatatcaaggat 

gt 

35 • * . 

Table 11: Sequence of Clone A48 (SEQ ID NO:12), related to 

. an immunoglobulin sequence 

40 . ^ ^ 

acaccattccacctcccaaggagcagatggccaaggataaagtcagtccgacctgcatga 

taacaaacttcttccctgaagacattactgtggagtggtagtggaatgggcagccagcgg 

agaactacaagaacactcagcccgtcacggacacagatggctcttacttcgtctacagca 

agctcaatgtgcagaagagcaactgggaggcaggaaatactttcacctgctctgtgttac 

45 atgagggcctgcacaaccaccatactgagaagagcctctcccactctcctggtaaatgat 

cccagtgtccttggagccctctggtcctacaggactctgacacctacctccacccctccc 

tgtgtaaataaagcacccagcactgccttgggaccctgcaaaaaaaaaaaaaaaaaaaaa 

aaaaaaaagt 

50 

Table 12: Sequence of Clone B45 (SEQ ID NO:13), related to 
kidney androgen related protein 

55 

acgcggggaatttctttctcctgttcactggttgccttaaccctactaaagcatgatgct 
tttcaaggtcctggtgatcactgtcttctgtggtctgactgtggctttccccctgtcaga 
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attagtttcaatcaataaagaactacagaattcaatcattgacctactaaactcagtctt 
tgaccaactgggatcatacagagggacaaaagctcctctagaggattatacagatgatga 
tttaagcactgactctgagcagatcatggacttcacgccagccgcaaacaaacagaattc 
tgagttctctactgatgttgagacagtctcctccggctttctggaagaattcactgagaa 
cacagacatcacagtgaaaattccattagctgggaatccagtctcccctacttcctgaga 
agactccaactttgaaataaacctgcgactgccattcaggatattgtctttgtagaaaat 
gatataaatttaagatgctcccagcaaagaagaaagaacgaggaagtgatcctggatgag 
aaatgctgtgccaggcatctttaagctctgcccttactctgtgtgggtttaagaaataaa 

cgttcatttg 



tTaJble 13: Sequence of Clone C22 (SEQ ID NO:14), related to 
disabled -2 p96 (Dab2) 

actgtgtaggtttccaaggctatgaggaatatatttcaggcagatttttcaatgataacc 
ccaaatatttgtccatgattgtggcttcaagaanttttccagttacttgataaaggccta 
gtcatccagtaaggacccagtgtgtctagcatagtcacatttagctttcagaatacagtc 
cttcactgctgagcagaatccagttcccctactgtgttaaattgctctatgaaacttcag 
tctaagtcccccaagccatagggagaatgaagaaggtgcaccttccagaagtctagcccc 
tggactaacaatgactggagtgggtgcgtgcacttcagatctatgaaacatttaatgctt 
tttgccccgtgaaaaatccattcctggtaaaagggaggcactgtttatataaacccagga 

gcaaaatgaactg 



TaJbie 14: Sequence of Clone E39^ related to an 
Immunoglobulin sequence 

14A: beginning of coding strand (SEQ ID NO: 15) 

acgcgggggagccacacaaactcagggaaagctcgaatatggttttcaaacctcagatac 

ttggacttatgctttttaggatttcagcctccagaggtgatattggctacctcaatctcc 

agccaccctgtctgtgcQtccaggagatagcgtcagtctttccttcagggccagccaaaa 

tattagcaacaacctacactgggttcaaaaaaaccacatgagtctccaaggcttctcatt 

aaagtcacacctggcctcatcacgttcggagctgggaccaagctggagctgaaadggact 

gatgctgcaccaactgtatccaatcttcccaccatccagtgaacagttaacatctggagg 

tgcctcagtcgtgtgcttcttgaacaacttctaccccaaagacatcaantgtcaaagtgg 

gaagwtttgatgggcagatgaaaccgaacaaaaatggcgtccctggaacagtttgggaac 

tgatcagggacaggcaaagaacagcacctacagcatggaagcagcaacccttcaacgttt 

gaccaaggaccgaagttattgaaaccgaccataacagctt 

14B: end of coding strand, obtained by sequencing opposite 
strand, deducing complement, and arranging in 5 '-3' order 
(SEQ ID NO: 16) 

nngnnccctcacgttgaccaagggnngagtatgaaggacataacagctatacctgtgagg 
ccactcgcaaggcatcaacttcacccattgtcaagaggttcaacaggaatgagtgttaga 
gacaaaggtcctgagacgccaccaccagctccccagccccatcctattttcccttttaag 
gtcttggaggttccccccaagcgacctaccactgttggggtgctccaaacctcctcccca 
cctccttttcctcctcctccctttccttggcttttatcatgctaatatttgcagaaaata 
ttccaataaagtgagtctttgcccttaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 



Table 15: Sequence of clone F27 (SEQ ID NO: 17^ related to a 
105 kDa heat shock protein 
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acaaattggatgcaaaatccaaaattcgagccctccttcgtctccatcaggagtgtgaaa 
agttgaaaaagctcatgagttctaacagcacggacctgccgctgaacatcgagtgcttta 
tgaatgacaaggatgtctctgggangatgaacaggtcacagtttgaagaactgtgtgctg 
agctcctgcaaaaaatagaggtcccccttcactcgttgatggcacagactcagctcaagg 
5 ctgaanatgtgagtgccattganatagtgggaggtgcccaagaatcccanctgtnaaaga 
aagaattgccaanttctttggaaaagatgtnancacccacnctnaatgntgacnaagctg 
taggccngfiaggctgangcactgnctngtncaatnctttctccggcattttaaagttaca 
gantcntnttttcncnccatgctacccnttntccnaatatcanctggtcttggaacccnn 
tgnctcnnnangaaaacnttaacnntgttcnccnaccttntntcatnncnggaaccntgn 
10 ncgntnnttttnttccanacgtgtcnnnncctttnctnngcanaggggnggccctttngt 
nctacnaacnnttttcttcttnaccncncnanggnantncnngtnnntaaancaanacna 
nnnnnccnnatannactgccnnaacntttntaccntcnanccgntnnttttcccnnncnc 
tanngtnanancctcnaannnncntngtntnttcatcntncnncnccntnncttgcnnnn 
angnccnnncacnnccannnnactnctcttcctannctcn 



Table 16: Sequence of clone G38 (SEQ ID NO: 18, related to an 
. immunoglobulin protein 
. .. 20 . 

acgcggggnattgtcattgcagtcaggnactcagcatggnaqatgagggctcctgcacag 
natttttggcttcttgttgctcttg.tttccaggtaccagatgtgacatccagatgaccca 
gtctccatcctccttatctgcctctctgggagaaagagtcagtctcacttgtcgggcaag • 
tcaggatattggtagtagcttaaactggcttcagcaggaAccagatggaactattaaacg 
cctgatctacgccacatccagtttagattctggtgtccccaagaggttcagtggcagtag 
gtctgggtcagattattctctcaccatcagcagccttgagtctgaagattttgtagacta 
ttactgtctacaatatgctacttctcctccgacgttcggtgggggcaccaagctggaaat 
caaacgggctgatgctgcaccaactgtatccatcttcccaccatccagtgagcagttaac 
atctggaggtgcctcagtcgtgtgcttcttgaacaacttctaccccaaagacatcaatgt 
caagtggaagattgatggcagtgaacgacaaaatgggcgtcctgaacagttggactgatc . 
aggacagcaaaanacngcacctacagcatgancagcacccttacgttgaccaaggacnag 
tttgaacgacntancnctttacctgngaggccctnacaaaaatnacttncccattgnnag 
acttcacaggaatgagtgtanaaacaaggnctgnacnccccccnntnccanttcntctat 
ttcctttaaggttggaggttccccannacnccnttntgggggctcaaactntnccnctct 

tt 



Table 17 : Sequence of clone B3 

40 

17A: beginning of coding strand (SEQ ID NO: 19) 
acttgaactggccttcttatccaggataagtggagttcatgcaaggtggatccctcatga 
taggattagtggctttataagaacagtgattgagaaagaattcattctgttgacgcccat 
gcaagcggaacagcaaactgcttaagaactcatggaatgtggcccactggttagcaccat 
• 45 ctcaatgggtgcgccttggagaggcactacatgtcctgaggatcacatgctgctgtagag 
cgtcactctgcttgttacccttgtggtggtcatcatgtctccccctaatctatgagctng 
tatgaaaactctaagagggtttccagaccctcacagggcagtgtcattggcatttacaat 
agttgttcttgatctttctcaagcattactgctggggttagattaatgaattcagtttac 
cattcaantaaagttagggttaaggtatttttgctaagtnggtttntgatcgtaccaaaa 
50 • tcttggaggagcaatggtcaaagtttttagaaaggccntttangaaaagttttgataacc 
cnantggaaaggattnttttggangntcnagggnaancaaagnanttaaccaccaagttn 

cnt 

17B: end of coding strand/ obtained by sequencing opposite 
55 strand, deducing complement, and arranging in 5 '-3' order 
(SEQ ID NO: 20) 

ctttttaagataaattgaaacctgccagtaagacaacctggggagaaaattgccttgtta 
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agttataacaagttctttggttatgaatgcatgtgggttcttgggaataatttgttttta 
acccataatatgtaaaacatttgatcatgtgaagctattgaggagcatatgtatctctct 
tttttaaaaatgtattctcattgtaattattcacttaaagaaaagtagaatgtaaccaca 
tcgtggttttatgctgcttgaaagaatttcctggtagatatataagctatggaagaaaaa 
5 taaaaattttggagcttgcatcatccgaaaaaaaaaaaaaaaaaaaaaaaaaagt 

Table 18: Sequence of clone F39 

18A beginning of coding strand (SEQ ID NO: 21) : 
10 acgcgggggagccacacaaactcagggaaagctcgaatatggttttcaaacctcagatac 
ttggacttatgctttttaggatttcagcctccagaggtgatattggctacctcaatctcc 
agccaccctgtctgtgcctccaggagatagcgtcagtctttccttcagggccagccaaaa 
tattagcaacaacctacactgggttcaaaaaaaccacatgagtctccaaggcttctcatt 
aaagtcacacctggcctcatcacgttcggagctgggaccaagctggagctgaaacggact 
15 gatgctgcaccaactgtatccaatcttcccaccatccagtgaacagttaacatctggagg 
tgcctcagtcgtgtgcttcttgaacaacttctaccccaaagacatcaantgtcaaagtgg 
gaagwtttgatgggcagatgaaaccgaacaaaaatggcgtccctggaacagtttgggaac 
tgatcagggacaggcaaagaacagcacctacagcatggaagcagcaacacttcaacgttt ♦ 
gaccaaggaccgaagttattgaaaccgaccataacagctt 

20 . ^ 

18B END OF CODING STRAND^ obtained by sequencing opposite strand, 
deducing complement, and- arranging in 5 '-3' order (SEQ ID 
.NO:22) : 

nngnnccctcacgttgaccaagggnngagtatgaaggacataacagctatacctgtgagg 
25 .ccactcgcaaggcatcaacttcacccattgtcaagaggttcaacaggaatga^tgttaga 
gacaaaggtcctgagacgccaccaccagctccccagccccatcctattttcccttttaag 
gtcttggaggttccccccaagcgacctaccactgttggggtgctccaaacctcctcccca 
cctccttttcctcctcctccctttccttggcttttatcatgctaatatttgcagaaaata 
ttccaataaagtgagtctttgcccttaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

30 

Table 19: Sequence of F2 (SEQ ID NO: 23) 

acctccccaaaatgactcttttggagcacagttacctccaatgcatcagcagcaaagcag 
atctgtaatgacagaagaatacaaagttccagatggaatggttggatttataattggcag 
aggaggtnaacagatctcacngngngcagcaggaatctggatgcangatacanatagcnc 

35 . -ctgatagtggtggcctaccanaaaggtctngtatgctaactggaacacctgaatctgtcc 
agtcagcaaaaagattattggaccagattgttgaaaagggaagaccagcccctggccttc 
atcatggngatggacctggaaatgcagttcaggaaatcatgattccanccagcnaagcna 
ggactagnttattcggaaaggggggccgagactattaatcnnctttcncgaacggcctgg 
gtgttaaaatggttatgattnaagatnggcctcaaaaaccntgggggtgatnaacctctt 

40 agggattnggggttnnccatacaaagttnagcaagccaagcnnnangtattaggagttaa 
ttngtgaanaaggtggtttcagagaagtgcggaatgagtatggctcaanaataggaggca 
atgaagggatagatgtcccaatttccaagatttgctgttggcattgtaataggaagttnt 
ttnaaaaanaacnnccaaaaaaaaaaa 

• 45 

Table 20: Sequence of F21 

20A beginning of coding strand (SEQ ID NO: 24) : 

ttnngnantacngngcncaacccnttttaggaacggnccccantgtgctggaattcgccc 

ttagcttggtcgcggccgaggnnctacggctcaaaaccccgtcaccacggcccggatcgg 

50 gaagtgtcaagtgggagccgggttcccccgcatttgcagccgccacanctgaacccggac 
ccctcaccccagccgggcnctaccgccntttttaacntgtnctnnngngggngggatcct 
ggagttngntttgataaaaaaanccccccctgtgccngngggnagnaccnncnntntncc 
tccaaaccnngnctgctnccaataaaananaggncntcgtggnnttntatacatacaaaa 
ncnccctnatnatcngnaacnacnaccnnnntncctnnctngttcnantancccncancn 

55 gacaaatcnccctntncacatncccccnccnaacnnnctacaantcnacaaacccctnnc 
nacccccattnatcttagcccncccntaggncnancntacaccnanccaatcnngcacan 
cnaanacccngccnccccctccnctacnaacctccnnacctntcatacnttccnnccgtc 
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ncccnntnatcaacnnctccnntacnncnctcccccccccnannngttcccccaaancnn 
nccactaancacccgnacctttnnnncccntnaanagnccatcccctaantncccacccc 
ctttcnctcnntnactntntnnnnncccccncacacancantntacnnntnttctntnca 
ccacaaccatttccccnantttnacntccaatctcantcctacccccnncnacacccnca 
cctcacttcaaccatanaccnnncncnttcgtccccancncaccncannncccctnncnc 
acncactcttcnnntcnntncnnnttcacccantaccccntcgcaacgntcccncnantn 

ccannntcccncacn 



20B end of coding strand, obtained .by sequencing opposite 
strand, deducing complement, and arranging in 5 '-3' 
order.(SEQ ID NO: 25) : 

gaagngnaaaattttttttaaaaaaaaaaaaaaaaaaaaaaa . 
Table 21: Sequence- of F38 

21A beginning of coding strand (SEQ ID NO: 26) : 
gntctgtgaaanganatagannggctcgaaggaccaaggcancttcctgactgaagacca 

gatcctgccgctgttgctgggtatcagcagaggccttgaggctattcatgccaaaggtta 

tgcacacagggacctgaagcccgtngnnnattttgcttggtgatgaggggcanccaantn 

taatggacatgggttntnttaatcanttcatgcattcaagtggagggctctcgccaggca 

ctaactcttcaggactgggcagctcagcggtgcaccatctcctaccgggcacctgaactt 

ttttctgtgcaaagcctn 

21B end of coding strand, obtained by sequencing opposite 
strand, deducing complement, and arranging in 5 '-3' 
order(SEQ ID NO:27): 

ctagnggcctttgtttttnntnaggtttcccggggggtggcnccctttttntttntnggg 
ccccccttttttgnantggntntttttttncnncgccctctgggggggnntcncgcggcg 
cntttttttgttgngggggntttgggggngncnntttcttcgtgggnccgntnttttttt 
tntncnaaaccccnngnttcnttnttgggggctntttgtgnnncgnngggggnngntttt 
tcccttntttttngggggnggccccncnntattgnngtttnccaanngggggcganngng 
cccntcttgggccaaattnagngngcntcccccncannccccccggctttttttancntt 
gtgnanntttttntttttttnttgtgggggggggccccccgcgggggnctncntnccctn 
tncnncncngthaggntgggggnntttccccccccngnttnnggccgnncccccccccnn 
tttnnntnnannnngngggcnttttggggnangncccntnnaggggntttttncccccnt 
tgggncncnnnccntttttnccgggnggntttnccngntnnnggcanggnnagggggncc 
ntnttttttttccctttttccccccantcnngggnnngggccctngggggggnngggggg 
aantttanccnnnantttgggncnaanntngggnngggggggcccccnncnttttntttc 
ccntntngggggnnaaaaaaancntnaaaaggnnaatnangttaaacccncccaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

aaaaaaaaaaaacaaaaaaa 

Table 22: Sequence of F40 (SEQ ID NO: 28) 

acgcggggtttccggcttgaggatccgtcgcgtttctgaggcggcgcctcagtcggcagt 

taggatcggctcggagcggtgggtctcggccccggcgtcaccatgggggccgtcctcggc 

gtcttctccctcgccagctgggtcccgtgcctctgtagtggtgcatcatgtctgctgtgc 

agttgctgtcccatcagtaagaattccactgtaactcgggtcatctacgcttttatcctc 

ttccttggcactattgtgtcttgcatcatgatgacagaaggcatacaaactcaactgaag 

aagattcctggattctgtgaaggaggatttcaaatcaagatggttgatacaaaggcagag 

aaagattgtgacgtgctggtcggttttaaagctgtgtatcggatcaactttgctgtggcc 

atttttttctttgccttctttttgctcatgttaaaagttaaaacaagtaaagatcccaga 

gcagcagtgcacaacgggttttgggttcttcaaaatcgctgccattatggtatcatgatt 

ggatctttctacatccctggggggagttttactgaagtctggttttttgctggaatgttg 

ggggcctctttcttcattatcatccagctggtgctcttggtagacatggctcactcttgg 

aatgaattatgggtaaatcgaatggaggaaggaaacccaaggctctggtatgctgccttg 

ctgtcctttacaagcctcttttacatcctctccatcgtctttgccgcgctgctctacgtc 
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ttttacaccaagcctaacgactgcacagaaaacaaggtcttcatcagcctcaacctgatt 
ttttgtgttgcagtttcatattgtgtccatcctccgtaaagttcaggaacatcagcctcg 
ctctggcctcctgcagtcctccatcattactaaga 

5. 

TABLE 23: SEQUENCE OF G9 (SEQ ID NO: 29) 

acgcggggatctaggaggctgctgacctccaacaggaatttcaccacttaaccctctaga 
agtcccactacttaatacttcagtacttctagcatcaggtgtttcaattacatg^gctca 
tcatagccttatagaaggtaaacgaaaccacataaatcaagccctactaattaccattat 
10 actaggactttacttcaccatcctccaagcttcagaatactttgaaacatcattctccat 
ttcagatggtatctatggttctacattcttcatggctactggattccatggactccatgt 
aattattggatcaacattccttattgtttgcctactacgacaactaaaat.ttcacttcac 
atcaaaacatcacttcggatttgaagccgcagcatgatactgacattttgtagacgtaat . 
cttgetcttttcctatacgtctccatttattgatgaggatcttaaaaaaaaaaaaaaaaaa • 
. 15 aaaaaaaaaaaaaaaaaaaaagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

TABLE 24: SEQUENCE OF G16 (SEQ ID NO:; 30) 
• • .acgcggggnagttgcggccctgtgcgcttccacagcgtcccagcctttgtcctggcct-ca 
gctcctgcttcctgacctgcttgagttccagtcctgacttcctttggcgataaacagcag 

20 tatggaaatacctcacagtggatgttcctggccatggaaaggtgctggtggacattgcat 
atggtggggcattttatgcgtttgttagtgcagaaaaattaggacttgatgtgtgttctg 
cgaagacgagggaccttgtggatgcggcaagtgcattgacaggagcagtgaaagcacagt 
ttaaaatcaaccatcctgagagtgaagaccttggttttttgtatggatccatcttgacgg 
atggaaaagatgcttatagtgaggaggccaccaccaacatctgcgtgtttgccgatgaac • 

25 aggttgacagaagccccaccggctcgggagtgacagccagaattgctctgcagtatcata 
aggggctcctgcagctgaaccagaccagagccttcaaaagcagcgcaactggctcggtgt 
tcacaggctgtgccgtgagggaagcaaagtgtggagatttcaaagctgtcatagtggaag 
.: ttgcaggacaagcccacttacncagggrcagcaaacttgmcagtggaagatggtgacccc 
ctaagggatggcttttttttcaagkgacttttttttcataagttttaaaggntttgctta 

30 naaagaaatwccycttaaatggggntttAtctgnattatgggcnaaccngtntccngtta 
cccttaccgggtaatttttanactttttaaaaagtatgtttagtanactgngtttaaatg 
taaggnnttatacttcctntttgtcacccntttgngcataaatgtngntaatggggcnaa 
canagngccaanaatccggaaacttttnaagccttngtntncaaggcnaaattnaagccc 
ngttncttaatttntgctgnggaaatgataaaacttgtaagttttttttgattgntagnt 

35 tcagctntnanccccccnatagtaaatgttgngngtttgttnttttaatgttccngtttg 
ngtaaaataaaggcntcntgtaagnttccaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaancaaaaaaaaaaaaaaaaaaaaaaaaa 

TABLE 25: SEQUENCE OF G24 (SEQ ID NO: 31) 

40 acgcggggagactggggaaa-tcgaaattatttcagcagctgtcttcagtcactgggtgtg 
tcttcctgcacagtggacttgacgggcctggagggaggagaagaaattgaaacctatctc 
atcctctagtttatttacatcattggattcggttcctgtttggaaacttaactggtcctc 
atggcgttttcgggcacagggacactggtggcttccattgtctccaagatgacatcttca 
gcagccatggtcaaagctggaggggcagtctccggcaccattctagctggttcgaagggg 

45 ctcatcctgttaacacagagtgccctgggctctgcaacatctgcacttggagcaccgaag 
gttggcaccattttatctgggttctctgcctccaccttggctgcttcccccattggagcc 
aaggctgttgtggctgtgcttgggggagccatgacagtagctgctgtaccacctgcgctg 
agtgctgtgggcttcaccgcctcaggaattgcagcctcctctctagcagctaagatgatg 
tccttgtcagctattgctaatgggggtggagtcccagctggtggcctggtggccattttg 

50 cagtctgctggagctgctggactttccgtgccatctactgtcatcgtgggctttgcaggg 
tttgccgttgtggccagtgtcatgaacatctgtgaaagtttttacccttttttcatggga 
tcagaggtggcagacatggcgacaaaggtggcagatatggcgataaaggtggcagacatt 
tcaacaaaaaaggcactgcccaacttgtttaacccagagaaggattaattacccccaggc 
gacaatttcctaacacatattgcctaggtggaaataaattaaaccgagtttttaaaaaaa 

55 aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

TABLE 26: SEQUENCE OF G28 (SEQ ID" NO: 32) 
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accaccggnaccacagtncagcaacgtcaagnatggacatcctgtcttacatgnaggccg 

gcaggcagccctgagcgccagggggaagtaactgatgcccccaccctacccctaccccct 

gcccatcatgcaagagccgaggaggggctcttcaggaaggaagccacattcccagtcatt 

ctacccccaccccagattctctttcttattacataaaagacaagcctggcacaactgtgt 

gtctgaaccactgtggacacgtgacaattgtcccagtgtgtgcatggctacacagccacg 

tatctgcctgcttgaaacccagggatggtccatctgtgttt.acggcttggcacaacaccc 

tcatatttttttcagctttctgttccaaatgagcccaaaggaaacacaagttctaggtcc 

aatggttctgctcaaacctgaacatcatitcttggggccagcatctcccacacgcceacac 

tacacaccaccagcctccttcttccttcctgaaggaccctcctgagccccoaagcccatc 

ccacagtgctcctgagaccagccaagacaactgtgagcgcgatggccgtgtaccttgccc 

gggcggccgtcgaaagggcgaattccagccacttggcggccgtactagtggatccgagct 

cggtccaacttgatgcatacttggantnttntatagtggcacctaaatagcttggngtaa 

tntgggnatagctgnttctgngngaaatggtnttccgttanantccnacaaonacaaccc . 

griacataagtgtaaanctggggngcnangangncnnactanttantnnntngnctnntgc 

cnttcannngnaactnnnncnntnntnnantgcaccccggnnggngntnn 

TABLE 27: SEQUENCE OF B46.(SEQ ID NO:33) 

accacggggacaggatttgggacgttcctgatgaccgtgttgctgaaatacctgtgcgca 
gaatatggctggcggaatgctatgttcatccaaggtgctctgtccctgaacctgtgtgtc 
tgtggggcgctcatgaggcccctctctcccgagaagttagaaaactgcccagaagcggaa. 
gagccgtgtgctctcccagcttactccactgaatctgttaagtctgggggaccactgggc 
atggctgaagaacaggacagaaggcccgggaatgaggagatggtgtgtgaccttcaaaca 
caggagtgtcaaggtcaaacccatccgaggaagaatgtgtgtgccttccgggttctgaag 
acagtgagccagctcactgtgcaagtccggaggggctttagggactggcactcaggctat 
tttgggacggcttcactcttcaccaaccgcatgtttgtagcctttattttctgggctctg 
• ttcgcatacagcagctttgtcatccctttcatccacctgcccgaaatcgtcagtttgtat 
aacttgtcngaaccaaaatgacacattccctcttgacctcgattatagcaatacttcaca 
tcttcgggaaggtgatctttgggggccgtggnccacctcccttgtatcancgtctggaat 
ggtcttcctnatnngttacttnaccctcgtcctnnnatnttnctttttgcctttgatgcn 
cacctccccaanntggntnnctttgggccctaatgggntttctagcggttttntcctnnn 
cctggggnacnangactgntngcntnnaccttngcnatgctacgnntntattnnccangcr 
ntttnncttntngancccntnnngaggatttncnccca 

TABLE 28: SEQUENCE OF F16 (SEQ ID NO: 34) 

acttgctggtggtccatccctgcactatttccaagccaggtcttgggatcagtgacttgc 
ttcctgcagtgctgtggtcacaggcccctccagtcagccatgtctagcagcttggcaggc 
ttaggggctcggtgtgcagatttctttttgtgcagcagaatgccaagagactgagcagag 
agtcttcttaaaacctaccctaagggccgccaggtccatgtgcccgtggtaggtctgggg 
ccaggcagctgctgctgtcctgatcccccctcccttagctatggatccttgtagtagagg 
tcagccgctcctggcgcaaacaggatccagtagctgtctgccacccacgafifgtttctaag 
ataccatgtggagacagggtcctccccatccttttcagaaatagcagcttgttttctctg 
ctggcatgtgtccagggcaGcaggtcccagcagggggaggcagactgttaactgaacscg 
cataggaacaggaggcagcgagagcctgtgggaggaagggccttctagtgaccctgcttc 
tccatcctgcctccctcctgtccacccagtttcagagttgggaggggaggagccatggct 
ccctcacccaggcttggtggctgtggccctgttttagagccatggcagcacaggctggtg 

agtggtagagttgt 

In the above Tables, "^N" denotes "unknown". 
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Commen-bs on Clones and Cogna-be Database Sequences 

In these comment, "Full-length" means that it includes 
complete coding sequence 

B3 clone is not full-length. It is similar to a DNA • 
sequence in the database but does not match any protein in 
the database (no protein is given for the DNA sequence) . 

B45 clone is full-length as are the mouse and human 
homologs. B45 is • an exact match to the database proteins 
whereas P4 and P6 have a one amino acid difference. 

B46 clone is possibly full-length. 

BAB22782 hypothetical protein ~ putative [Mus 
musculus] is full-length. 

XP__087215 similar to unnamed protein product [Homo 
sapiens] is full-length. 

B2ffi70909 unnamed protein product [Homo sapiens] ^is 
full-length. 

C22 Clone is not full-length. Similarity is to 3' non- 
coding region of AF260579S2 Mus musculus disabled-2 p96 
(Dab2) gene, exons 2-15 that encodes protein AAG44669 
disabled-2 p96 [Mus musculus] (full-length) . This protein 
is similar to AAH03064 disabled (Drosophila) homolog 2 
(mitogen-responsive phosphoprotein) [Homo sapiens], P98082 
Disabled homolog 2 (Differentially expressed protein 2) 
(DOC-2) [Homo sapiens], and A2^23161. disabled-2 [Homo 
sapiens] (all full-length) . 

PI clone is not full-length. 

AAF23120 SON protein [Mus musculus] is full-length. 
NP_003094 SON DNA binding protein; SON DNA-binding 
protein; SON DNA-binding protein, KIAA1019; NRE-binding 
protein [Homo sapiens] are full-length. 
AAK07692 NREBP (negative regulatory element-binding 
protein) [Homo sapiens] is full-length. 
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F2 clone is not full-length. 

IIP_476513 DNA segment, Chr 3, ERATO Dei 330, 
expressed; far upstream element (FUSE) binding protein 
1 [Mus musculus] is full-length. 

AAH14763 Similar to far upstream element (FUSE) 
binding protein 1 [Mus musculus] is full-length. 
NP 003893 far upstream element-binding protein; far 
upstream element binding protein; FUSE-binding protein . 
[Homo sapiens] is full-length. 

AAA17976 FUSE binding protein [Homo sapiens] is full- 
length. 

A53184 myc far upstream element -binding protein - • 
human - is full-length. 

AM17247 far upstream element (FUSE) binding protein 1* 
[Homo sapiens] is full-length.* 

P4 and F6 clones are full-length. 

NP 034724 Icidney androgen regulated protein [Mus 
musculus] is full-length. 

Several other mouse matches are full-length, 
AAH08576 Icidney androgen regulated protein [Homo 
sapiens] is full-length. 

A2^650272 FKSG22 [Homo sapiens] is full-length. 

FS clone is possibly full-length. 

NP_076367 claudin 10; claudin-10 [Mus musculus] is 
full-length. 

BAB32005 and XP_127876 putative - similar to CLAUDIN- 
10 [Mus musculus] are full-length. 

BAB71030 unnamed protein product [Homo sapiens] is 
full-length . 

Note 1: Although, the htiman protein is ^^unnamed'S it is 
very similar to the mouse Claudin-10 listed above. 
Note 2: The first 73 amino acids of these Claudin-10 
proteins (including Clone F5) differ significantly from 
other proteins that are also named Claudin-10- 
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P16 clone encodes an unknown protein. The only significant 
similarity is to a genomic nucleotide sequence of Mus 
musculus. It is not known if P16 is full-length. 

F21 clone encodes an unknown protein. A small portion of 
nucleotide sequence is similar to a mouse clone encoding an 
unknown protein. It is not known if -Fai is full-length. 

F27 clone is probably not full-length. Only a portion of 
its protein sequence is similar to Mus musculus and Homo, 
sapiens Heat shock 105 kD protein (both of which are full- 
length) . The rest of the P27 sequence is novel. 

P38 clone is possibly full-length. 

A2UD02811 palmitylated serine/threonine kinase [Mus 
musculus] is probably full-length. 

BAA89662 F5-2 [Mus musculus] is probably full-length.. 
NP 035624 and others serine/threonine kinase 16 [Mus 
musculus] is probably full-length. 

NP 003682 and others serine/threonine kinase 16; 
protein expressed in day 12 fetal liver [Homo sapiens] 
is full-length. 

CAA06700 PKL12 protein [Homo sapiens] is full-length. 

F39 clone is probably not full-length. Only a portion of 
its protein sequence is similar to Mus musculus' and Homo 
sapiens Phosphotriesterase Related Protein (both of which 
are full-length). The rest of the F39 sequence is novel. 

*F40 clone is not full-length. 

AAH11295 Similar to tumor differentially expressed 1 
[Mus musculus] is full-length. 

AAH22901 Unknown [Mus musculus] is full-length. 
NP_036162 tumor differentially expressed 1 [Mus 
musculus] is full-length. 

AAD54420 membrane protein TMS-1 [Mus musculus] is 
full-length. 
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jMffi48858 Diff33 gene product [Homo sapiens] is full- 
length. 

AAD34641 transmembrane protein SBBI99 [Homo sapiens] 
is full-length. 

NP_006802 tumor differentially expressed 1; placental 
transmembane protein [Homo sapiens] is full-length. 

G9 clone is not full-length. 

CAA24Q90 cytochrome oxidase III (Mus musculus] is 
full-length. 

AAK17824 a^d AAIi54598 cytochrome c oxidase subunit 
III [Homo sapiens] are full-length. 

616 clone is not full-length. 

BAB31217 evidence :NAS-hypothetical proteins-putative 
[Mus musculus] is full-length. 

AAB04753 similar to RIKEN cDNA 2810055F11 gene [Mus 
musculus] is full-length. 

AAH12131 similar to RIKEN cDNA 2810055F11 gene [Homo 
sapiens] is full-length. 

624 clone is full-length. 

BAB22924 homolog to ERG2 protein putative- [Mus 
musculus] is full-length. 

AAH22800 Unknown [Homo sapiens] is not full-length. 
NP 114425 and AA635730 TLH2 9 protein precursor [Homo 
sapiens] appear to be full-length. 

626 clone is not full-length. 

NP_032320 and others hydroxysteroid dehydrogenase-4 ^ 
delta<5>-3-beta; 3-beta-hydroxysteroid 

dehydrogenase/delta-5-delta-4 isomerase [Mus musculus] 
are full-length. 

NP_000853 hydroxy-delta-5-steroid dehydrogenase, 3 
beta- and steroid delta-isomerase 1; Hydroxy-delta-5- 
steroid dehydrogenase , 3 beta- and steroid [Homo 
sapiens] is full-length. 
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628 clone is probably not full-length. 

BAB21943 and BAB23686 glutathione peroxidase 3 ~ 
putative [Mus musculus] are probably not full-length. 
XP 087620 siiflilar to glutathione peroxidase 3 (plasma) 
[Homo sapiens] may be full-length. 

HI clone is not full-length. Similarity is to 3' non-coding 
region of U96635 Mus musculus ubiquitin protein ligase 
Nedd-4 mRNA that encodes protein AAB63360 NEDD-4 .[Mus 
musculus] (full-length) . This protein is similar to P46934 
NEDD-4 protein [Homo sapiens] and,.BAA07655 KIAA0093 gene 
product is related to NEDD-4 protein [Homo sapiens] (full- 
length) . 

. • H8 clone is nearly full-length. 

NP 038505" ATPase, H+ transporting, lysosomal 38kDa, VO 
subunit D isoform 1; ATPase ,H+ transporting, lysosomal 
(vacuolar proton pump) , 42 kDa [Mus musculus] is full- 
length . 

XP 088368 similar to Ac39/physophilin [Homo sapiens] 
is full-length. 
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CLAIMS 

1. A method of protecting a human subject from kidney 
damage which comprises administering to the subject a 
protective amount of an agent which is 

(1) a polypeptide which is substantially structurally 
identical or . conservatively identical in s.eqvience to a 
reference protein selected from the group consisting of 

.(androgen related protein) NP_034724 and P15267; 

(SON protein) NP_064357, AAF23121, NP_003094, and AAK07692; 

(FOSE binding protein 1) NP_476513, NP_003893, and AAAn976 

(claudin 10) BAB32005, NP_067361, NP_076367, XP_127876, 
NP_008915, and BAB710301; 

(heat shock protein) BAA11035 and XP_036357; 

(phosphotriesterase related protein) NP_032987 and 
NP_109589; 

(vibiquitin protein ligase Nedd-4) AAB63360 , P46934, and 
BAA07 655; and 

(Ac39/physophilin) NP_0385 and XP_08836; 
or 

(2) an expression vector encoding the polypeptide of (1) 
above and expressible in a human cell, under conditions 
conducive to expression of the polypeptide of (1); 

where said agent protects said subject from kidney damage. 
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2. A method of protecting a human subject from kidney damage 
which comprises administering to the subject a protective 
amount of an agent which is 

(1) an antagonist of a polypeptide, occurring in said 
subject, which is substantially structurally identical- or 
conservatively identical in sequence to a reference protein 
selected from .the group consisting of 

(disabled-2 p96) AAG44669., AAH03064, P98082, and 7^F23161; 

(palmitylated serine/threonine kinase) AAD02811, BAA89662, 
NP_035624, NP_003682, and CAA06706; 

(tumor differentially expressed 1 protein) AAH11295, 
AAH22901, NO_036162, AAD54420, AAD22448^ AAB48858, AAD54-*420, 
NP_006802, AAB48858, and AAD34641; • 

(cytochrome oxidase III) CAA24090, AAK17824, and AAL54598; 

(TLH39 protein precursor) BAB22924, AAH22800, NP_114425, and 
AAG335730; 

(hydroxysteroid dehydrogenase 4 delta <5>-3 beta) NP_032320, 
NP_000853, and AAA51831; . and 

(glutathione peroxidase III)BAB21943, BAB23686, and 
XP_087620; or 

(2) an anti-sense vector which inhibits expression of said 
polypeptide in said subject, 

where said agent protects said subject from kidney damage 
mediated by said polypeptide. 

3. A method of protecting a human subject from kidney 
damage which comprises administering to the subject a 
protective amount of an agent which 
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(1) 

(a) down-regulates expression of an ^^unfavorable" 
protein which is identifiable as a hoinologue in said sub-ject 
of a mouse marker gene which is 

(i) up-regulated in a first group of mice, which 
are experiencing or are prone to kidney damage, and/or 

(ii) down-regulated in a second group of mice, 
which are protected against or otherwise less prone to 
kidney damage, relative tOr the other group, or 

(b) is an . antagonist for the expression product of the 
^^unfavorable'' gene, or 

(c) degrades that product, 

or 
(2.) 

(a) up-regulates expression of a ^^f avorable'' protein 
which is identifiable as a homologue in said subject of a 
mouse marker gene which is . • 

(i) down-regulated in said first group of mice 

and/or 

(ii) up-regulated in said second group of mice, 
relative to the other group or . • . 

(b) is an agonist for the favorable protein, or 

(c) inhibits the degradation of the favorable protein, 

or 

(d) is said favorable protein or a protein which is 
substantially or conservatively identical thereto*, or 
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(e) is an expression vector comprising a DNA sequence 
encoding said protein (d) and operably linked to a promoter 
whereby said protein (d) is expressed in cells of said 
subject which are transformed by said vector, 
where said agent protects said subject from kidney damage. 

4. A method of screening for human subjects who have 
developed, or are prone to development of, kidney damage, 
which comprises assaying tissue or body fluid samples from 
said subjects to determine the level of expression of a 

favorable" human marker gene, said human marker gene being 
substantially structurally identical or conservatively 
identical in sequence to a protein selected from the' group 
consisting of 

(androgen related protein) NP_034724 and P15267; 

(SON protein) NP_064357, AAF23121, NP_003094, and AAK07692; 

(FUSE binding protein 1) NP_476513, NP_003893, and AAA1797 6; 

(claudin 10) BAB32005, NP_067361, NP_0T6367, XP_127876, 
NP_008915, and BAB710301; 

(heat shock protein) BAA11035 and XP_036357; 

(phosphotriesterase related protein) NP_032987 and 
NP_10958 9; 

(ubiquitin protein ligase Nedd-4) AAB63360 , P4 6934, and 
BAA07655; and 

(Ac39/physophilin) NP_0385 and XP_08836; 

and directly correlating the level of expression of said 
marker gene with the development of kidney damage in said 
patient . 
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5. A method of screening for human subjects who have 
developed, or are prone to development of, kidney damage, 
which comprises assaying tissue or body fluid samples from 
said subjects to determine the level of expression of an 
^^unfavorable" human marker gene,, said human marker gene 
being substantially structurally identical or conservatively 
identical in sequence to a protein selected from the group 
consisting of 

(disabled-2 p96) AAG44669, AAH03064-, P98082,- and Z^F23.161; 

. (palmitylated serine/threonine kinase) AAD02811, BAA89662, 
NP_035624, NP_003682, and CAAD6700; . 

(tumor differentially expressed 1 protein) AAH11295, 
AAH22901, NO_036162, AAD54420, AAD22448, AAB48858, AAD54420, 
NP_006802, AAB48858, and AAD34641; 

(cytochrome oxidase III) CAA24090, AAK17824, and AAL54598; 

(TLH39 protein precursor) BAB22924, AAH22800, NP_114425, and 
AAG335730; 

(hydroxysteroid dehydrogenase 4 delta <5>-3 beta) NP_032320, 
NP_000853, and AAA51831; and 

(glutathione peroxidase III) BAB21943, BAB23686, and 
XP_087 620; 

and inversely correlating the level of expression of said 
marker gene with the development of kidney damage in said 
patient . 



6. A method of screening for human subjects who hav 
developed, or are prpne to development of, kidney damage, 
which comprises 
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assaying tissue or body fluid samples from said 
subjects to determine the level of expression of at least 
one human marker protein, where said human marker protein is 
identifiable as a homologue of a mouse marker gene which is 
expressed at different levels in a first group of mice who 
are experiencing or are prone to kidney damage and in ,a 
second group of mice protected against or otherwise less 
prone to kidney damage, and 

correlating said level of expression of said human 
marker-gene with the development of kidney damage In said-. 
subject. 

7 . The method of claims 4 or 6 in which the marker 
gene is one whose expression is down-regulated in the first 

group of mice . 

8. The method of claims 4 or 6 in which the marker 
gene is one whose expression is up-regulated in the second 
group of mice and/or down-regulated in the first group of 
mice . 

9. The method of claims 4 or 6 in which the first 
group of mice are treated with streptozotocin. 

- 10 . The method of claim 9 in which the second group of 
mice are treated with streptozotocin, but are genetically 
modified mice protected from diabetes-induced kidney damage 
by virtue of their genetic modification. 

11. The method of claim 10 in which the genetically 
modified mice are transgenic mice which express a growth 
hormone (GH) mutant which. is a GH receptor antagonist which 
antagonizes a growth hormone endogenously expressed by said 
mice . 

12. The method of claim 10 in which the genetically 
modified mice are mice which do not express a functional 
growth hormone receptor binding protein (GHRBP) . 

13. The method of claim 9 in which the second group of 
mice are nontransgenic mice which are not treated with 
streptozotocin . 

14. The method of any one of claims 4-6 in which the 
level of expression of the marker protein is ascertained by 
measuring the level of the corresponding messenger RNA. 
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15. The method of any one of claims 4-6 in which the 
level of expression is ascertained by measuring the level of 
a protein encoded by said marker gene. 

• 16. The method of any one of claims 1-8, 14-15 in 
which the subjects are diabetic and the kidney damage is, at 
•least in part, diabetes-induced. 

17. The method of any one of claims 1-8, 14,15 in which 
the subjects are hyperinsulinemic . 

18. The methpd of claims 4 or 6 in which the first 
group of mice exhibit kidney damage. 

19. The method of claims 4 or 6 in which the first 
group of mice are genetically modified to overproduce growth 
hoimone . 

20. The method- of claim 18 in which the first group of 
mice produce a heterologous growth hormone in addition to 
mouse growth hormone.; 

22. The method of claim 18 in which the second group 

r 

of mice are normal mice. 

23. The method of any one of claims 1-3 in which the 
agent is a DNA, or is encodable by a DNA, which speciieically 
hybridizes to the recited DNA strand of any of SEQ ID NOs: 
1-34, or to the complementary strand thereof. 

24. The method of any one of claims 1, 2, 4 or 5 in 
which said polypeptide is at least 80% identical or at least 
highly conservatively identical to said reference protein. 

25. The method of any one of claims 1, 2, 4 , 5 or 24 
in which said polypeptide iS at least 90% identical to said 
reference protein. 

26. The method of any one of claims 1, 2, 4 or 5 in 
which said polypeptide is identical to said reference 
protein . 

27. The method of claims 4 or 6 in which said homologue 
is identifiable by a BLASTN or BLASTX search conducted, 
using any of SEQ ID NOS: 1-34 as a query sequence, on the 
NCBI Entrez sequence database (s), on or before the filing 
date of the instant application, and the E value calculated 
by BLASTX or BLASTN for the alignment of that homologue, or 
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cDNA encoding that homologue, to the query sequence is less 
than e-10. 

28. The method of claim 27 in which the E value 
calculated by BLASTN or BLASTX would be less than e-15, more 
preferably less than e-20, still more preferably less than 
e-40, even more preferably less than e-60, considerably more 
preferably less than e-80, and most preferably less than e- 
100. 



wo 2004/013289 

^^PCTAJS2003/024053 

1/2 




4 10 

Duration of diabetes (weeks) 



FIGURE lA 



2.0 




Duration of diabetes (weelcs) 



FIGURE IB 



wo 2004/013289 




PCT/US2003/024053 



<D 

I 

I"? 

0) i< 
g O 

O 



16 
14 
12 
10 
8 
6 
4 
2 
0 



■ 




->=>;• 












1 





□ NT 
EibGH 



12 



Age (months) 



FIGURE 2 A 




Age (months) 



FIGURE 2B 




FIGURE 2C 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World InteUectual Property 
Organization 
Internationa] Bureau 

(43) International Publication Date 
12 February 2004 (12.02.2004) 



10 FEB 




(10) International Publication Number 

PCX wo 2004/013289 A2 



(51) International Patent Classification^: 
(21) International Application Number: 



C12N 



PCT/US2003/024053 
(22) IntemaUonal FlUng Date: 4 August 2003 (04.08.2003) 
(25) Filing Language: English 
(2Q PubllcaUon Language: English 



(30) Priority Data: 
60/400.052 



2 August 2002 (02.08.2002) US 



(71) Applicant (for all designated States except US)i OHIO 
UNIVERSITY [USAJS]; Technology Transfer Office, 
Unit 14, 340 West State Street, Athens, OH 45701 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (/br 6/5 ort/y): KOPCHICK, John, 
J. [USAJS]; 4 Orchaid Lane/ Athens, OH 45701 (US). 
COSCHIGANO, Karen, T. [USAJS]; 11703 Channing- 
way Blvd., The Plains, OH 45780 (US). HOLLAND, 



Amy, N. [USAJS]; 2518 Cleveland Road, Wooster, OH 
44691 (US). 

(74) Agents: BROWDY AND NEIMARK, PXX.C. et al.; 
624 Ninth Street N.W., Suite 300, Washington, DC 20001- 

5303 (US). 

(81) Designated States (national): AU, CA, JP, US. 

(84) Designated States (regional): European patent (AT, BE, 
BG, CH. CY, CZ, DE, DK, EE, ES, H, FR, GB. GR, HU, 
IE, rr, LU. MC. NL, PT. RO, SE, SI, SK, TR). 

PubUshed: 

— without international search report and to be republished 
upon receipt of that report 

(48) Date of publication of this comected version: 

12 August 2004 

(15) Information about Correction: 

see PCT Gazette No. 33/2004 of 12 August 2004, Section H 

[Continued on next page] 



= (54) Title: DIAGNOSIS OF KIDNEY DAMAGE AND PROTECTION AGAINST SAME 



2.0 



< 
00 



o 

I _ 1.5 
o> .5 

3. C 

2 '-5 1.0 

03 



0.5 
0.0 



• • • 



I 



4 ^ 10 

DURATION OF DIABETES 
( weeks) 




O (57) Abstract: Various nucleic acids and proteins have been identified by differential hybridization methods as useful as maricers 
O for diagnosing kidney damage. The identified maricer proteins include (I) androgen related protein, SON protein, FUSE binding 
^ Protein 1, claudinlO, heat shock protein, phospho triesterase related protein, ubiquitin protein ligase Nedd-4, and Ac39/physophilin, 
Q and (II) disabled-2 p96, palmitylated serine/threonine kinase, tumor differentially expressed 1 protein, cytochrome oxidase EI, TLH 
39 protein precursor, hydroxysteroid dehydrogenase 4 delta <5>-3 beta, and glutathione peroxidase IE. The proteins of group (I), 
and antagonists of the proteins of group (II), are useful for protecting mammals against kidney damage. 



wo 2004/013289 A2 IDIIIiiPlliiiiililliiiinii 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette, 



> 



wo 2004/013289 



PCTAIS2003/024053 



1 

DIAGNOSIS OF KXDMEY DAMAGE AMD PROTECTION AGAINST SAME 

BACKGROUND OF THE INVENTION 

Field of the Invention 
5 The invention relates to various nucleic acid molecules 

and proteins, and their use in (1) diagnosing kidney damage/ 
or conditions associated with the development of kidney 
damage/ and (2) protecting mammals (including humans) 
against kidney damage. 

10 

Description of the Backg round Art 
Kidney Anatomy and Physiology 

In humans / the kidneys are the main excretory organs, 
eliminating urea, citric acid, creatinine and other waste 

15 metabolites. The kidneys also conserve or excrete water and 
electrolytes as required. Finally, they produce the hormone 
* erythropoietin as well as other proteins. 

The kidneys are paired organs situated on the posterior 
wall of the adrenal cavity. The cortex is the outer layer. 

20 The medulla consists of multiple triangular renal .pyramids 
whose bases are covered by the cortex and whose tips 
(papillae) project into the minor calyces of the renal 
pelvis. Within the renal pelvis, the minor calyces join to 
form a major calyx / and the major calyces join to form the 

25 renal pelvis, the expanded upper end of the ureter. 

The functional unit of the kidney, where urine is 
formed/ is the nephron. Each nephron consists of (1) a knot 
of coiled capillaries called a glomerulus / and (2) a tubule. 
The proximal end of the tubule forms a cup/ the glomerular 

30 capsule/ which surrounds the glomerulus. The glomerular 
capsule and glomerulus together form the renal corpuscle. 
The renal corpuscles are located in the cortical region of 
the kidney. Moving away from the glomerular capsule, the 
nephron has, in order (1) the proximal convoluted tubule, 

35 (2) the loop of Henle, and (3) the distal convoluted tubule. 

The distal convoluted tubules of several nephrons emptj 
into a collecting tubule, and 10-25 collecting tubules emptj 
into each renal pyramid. 
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Blood passes from the aorta to the renal artery, which 
divides into interlobar arteries (in the renal columns of 
the cortex, which separate the pyramids) , then into arcuate 
arteries parallel to the surface of the kidney, then into 
5 interlobular arteries, and then into afferent arterioles, 
each of which supplies a renal corpuscle and forms a 
capillary network called a glomerulus. The blood leaves 
through an efferent arteriole that divides into peritubular 
capillaries surrounding the tubules; these capillaries 

10 converge into interlobular veins, which feed arcuate veins, 
which empty into interlobar veins, and then connect to the 
renal vein, which in turn feeds the inferior vena cava. 
There thus are two capillary beds, the glomerulus and the 
peritubular capillaries. 

15 The glomeruli and the renal tubules (nephrons and 

collecting tubules) participate in glomerular filtration, 
tubular reabsorption, and tubular secretion. 

Glomerulosclerosis 

20 Glomerulosclerosis refers to a scarring of the 

glomeruli. The "scarring" is caused by deposition of 
cellular basement membrane proteins such as collagen IV. 
The scarring disrupts the filtering process of the kidneys, 
allowing protein to leak from the blood into the urine 

25 (proteinurea) . About 15% of people with proteinurea are 
diagnosed, usually by kidney biopsy, as having 
glomerulosclerosis. The scarred glomeruli cannot be 
repaired. 

30 End Stage Renal Disease ' 

Many patients with glomerulosclerosis gradually get 
worse until their kidneys fail completely, a condition knowr 
as end stage renal disease (ESRD) . Patients with ESRD must 
go on dialysis or receive a kidney transplant (a new 
35 kidney) . The progression of kidney failure can be slowed 
using ACE inhibitors. 

Each year, in the United States, nearly 80,000 people 
suffer kidney failure. In 1998, -the causes of kidney 
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failure were diabetes ' (43 . 2%) , high blood pressure (23.0%), 
glomerulonephritis (12.3%), polycystic kidney disease 
(2.9%), or other (18.6%). 

5 Diabetes 

Diabetes mellitus is a pleiotropic disease of great 
complexity. The two major types have been termed type I or 
insulin-dependent diabetes mellitus (IDDM) and type II or 
non-insulin-dependent diabetes mellitus (NIDDM) . Type II 

10 diabetes is the predominant form found in the Western world; 
fewer than 8% of diabetic Americans have the type I disease. 
Development of type II diabetes may be linked to a more 
sedentary lifestyle and obesity. 

Type I diabetics are often characterized by their low 

15 or absent levels of circulating endogenous insulin (Unger 

and Foster, 1998) . Islet cell antibodies causing damage to 
the pancreas are frequently present at diagnosis. Injectior. 
of exogenous insulin is required to prevent ketosis and 
sustain life. Type II diabetics are often characterized by 

20 hyperinsulinemia and an increasing resistance to insulin. 

They are usually not insulin dependent or prone to ketosis 
under normal circumstances. Hyperglycemia can usually be 
controlled by an alteration in diet and amount of exercise, 
but insulin treatment may be required.' 

25 Complications of diabetes (end organ damage) include 

retinopathy, neuropathy, and nephropathy (traditionally 
designated as microvascular complications) as well as 
atherosclerosis (a macrovascular complication) . It has beer 
shown that meticulous blood glucose control can often. slow 

30 down or halt the progression of diabetic complications if 
caught early enough (Unger and Foster, 1998) . However, 
tight metabolic control is extremely difficult to achieve. 

Diabetic Nephropathy 
35 Diabetic nephropathy is the leading cause of end stage 

renal disease. About 20 million people in the United States 
have diabetes, and about 100,000 people have kidney failure 
as a result of diabetes. 
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Initial nephromegaly as well as glomerular hypertrophy and 
microalbuminuria/ once thought to be limited to type I 
diabetics, are now seen equally in type II diabetics (Wirta 
et al., 1996). The pathology of diabetic nephropathy begins 
with thickening of the glomerular basement membrane (GBM) , 
an increase in mesangial matrix, and subintimal hyaline 
thickening of both afferent and efferent arterioles (Unger 
and Foster, 1998) . The capacity td clear macromolecules is 
impaired in diabetes, resulting in acciamulation of albumin 
and larger proteins within the glomerular wall and in the 
mesangiiHa that may lead to stimulation of mesangial matrix 
production. 

The progression of kidney failure occurs in five stages 
(Wardle, 1996) : (I) hyperf iltration (increased blood flow) 
and enlargement of the kidneys (II) glomeruli begin to show 
damage, e.g., by microalbuminuria (20-200 ug/min loss of 
albumin); (III) dipstick-positive proteinurea (>200 ug/min) , 
blood levels of creatinine and urea-nitrogen use, (IV) 
glomerular filtration decreases to less than 75 ml/min, and 
(V) kidney failure: filtration rate less than lOml/mini 

Features of the first stage include an increase in 
kidney volume (renal hypertrophy) and function (increased 
glomerular filtration rate or GFR) . Features of the second 
stage include a thickened GBM. Features of the third stage 
include detection of microalbuminuria. Once this stage is 
reached, there is little chance for reversal and the disease 
progresses on to the fourth stage (glomerulosclerosis and 
macroalbuminuria) and the fifth stage (ESRD) . 

Other Kidney Disorders 

Glomerulonephritis is an inflammation of the glomeruli, 
often due to an allergic reaction to streptococcal toxins. 

Pyelonephritis is a bacterial infection of the kidney 
pelvis and surrounding tissues. 

Kidney stones may be formed in the renal pelvis. 



Growth Hormone 

Growth hormone has many roles, ranging from regulation 
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of protein, fat and carbohydrate metabolism to growth 
promotion. GH is produced in the somatrophic cells of the 
anterior pituitary and exerts its effects either through th< 
GH-induced action of IGF-I, in the case of growth promotion, 
5 or by direct interaction with the GHR on target cells 
including liver, muscle, adipose, and kidney cells. 
Hyposecretion of GH during development leads to dwarfism, 
and hypersecretion before puberty leads to gigantism. In 
adults, hypersecretion of GH results in acromegaly, a 

10 clinical condition characterized by enlarged facial bones, 
hands, feet, fatigue and an increase in weight. Of those 
individuals with acromegaly, 25% develop type 2 diabetes. 
This may be due to insulin resistance caused by the high 
circulating levels of GH leading to high circulating levels 

15 of insulin (Kopchick et al.. Annual Rev. Nutrition 1999. 
19:437-61) . . 

A further mode of GH action may be through the 
transcriptional regulation of a number of genes contributinc 
to the physiological effects of GH. 

20 

Diabetic Glomerulosclerosis in Animal Models 

One hallmark of kidney damage is. the development of 
glomerulosclerosis. Several mouse models develop 
glomerulosclerosis similar to that seen in human diabetics. 

25 Streptozotocin (STZ) induced diabetic mice, as well as the 
spontaneously occurring non-obese diabetic mice, develop 
glomerulosclerosis . 

Interestingly, bovine growth hormone (bGH) expressing 
transgenic mice, which are not diabetic but are 

30 hyperinsulinemic, develop severe glomerulosclerosis. In 
contrast, both bGH antagonist transgene mice and GH 
receptor/binding protein (GHR/BP) gene-disrupted homozygous 
(-/-) mice are protected from kidney damage, even when 
diabetes is induced by STZ treatment. 

35 McGrane, et al., J. Biol. Chem. 263:11443-51 (1988) anc 

Chen, et al . , J. Biol. Chem., 269:15892-7 (1994) describe 
the genetic engineering of mice to express bovine growth 
hormone (bGH) or htaman growth hormone (hGH) , respectively. 
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These mice exhibited an enhanced growth phenotype. They 
also developed kidney lesions similar to those seen in 
diabetic glomerulosclerosis, see Yang, et al.. Lab. Invest., 
68:62-70 (1993). Ogueta, et al., J. Endocrinol., 165: 321-8 
5 (2000) reported that transgenic mice expressing bovine GH 
develop arthritic disorder and self-antibodies. 

Growth hormone genes and the proteins encoded by them ' 
can be converted into growth hormone antagonists by 
mutation, see Kopchick USP 5,350,836. Transgenic mice have 

10 ' been made that express the GH antagonists bGH-G119R or hGH 
G120R, and which exhibit a dwarf phenotype. Chen, et al., 
J. Biol. Chem., 263:15892-7 (1994); Chen, et al., Mol. 
Endocrinol, 5:1845-52 (1991); Chen, et al., Proc. Nat. Acad. 
Sci. USA 87:5061-5 (1990) . These mice did not develop 

15 kidney lesions. See Yang (1993), supra. 

Chen, et al., Endocrinol, 136:660-7 (1995) compared the 
effect of streptozotocin treatment in normal nontransgenic 
mice, and in mice transgenic for (1) a GH receptor 
antagonist, the G119R mutant of bovine growth hormone or (2) 

20 the E117L-mutant of bGH. (According to Chen's ref. 24, 
these large GH transgenic streptozotocin-treated mice 
constitute an animal model for diabetes.) 
Glomerulosclerosis was seen in diabetic (STZ-treated) 
nontransgenic mice and in diabetic bGH-E117L mice, but not 

25 in diabetic bGH-G119R mice. 

Two of the proteins which mediate growth hormone 
activity are the growth hormone receptor and the growth 
hormone binding protein, encoded by the same gene in 
mice (GHR/BP) . It is possible to genetically engineer mice 

30 so that the gene encoding these proteins is disrupted 

(^'knocked-out"; inactivated), see Zhou, et al . , Proc. Nat. 
Acad. Sci. (USA), 94:13215-20 (1997), Zhou, et al . 
inactivated the GHR/BP gene by replacing the 3' portion of 
exon 4 (which encodes a portion of the GH binding domains) 

35 and the 5* region of intron 4 with a neo gene cassette. Th€ 
modified gene was introduced into the target mice by 
homologous recombination. Like mice expressing a GH 
antagonist, homozygous GHR/BP-KO mice exhibit a dwarf 
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phenotype. GHR/BP-KO mice, made diabetic by streptozotocin 
treatment, are protected from the development of diabetes- 
associated nephropathy. Bellush, et al., Endocrinol./ 
141:163-8 (2000). 

Differentlal/Subtractive Hybridization 

Zhang, et al.. Kidney International, 56:549-558 (1999) 
identified genes up-regulated in 5/6 nephrectomized 
(subtotal renal ablation) mouse kidney by a PCR-based 
subtraction method. Ten known and nine novel genes were 
identified. The ultimate goal was to identify genes 
involved in glomerular hyperf iltration and hypertrophy. 

Melia, et al . / Endocrinol. , 139:688-95 (1998) applied 
subtractive hybridization methods for the identification of 
androgen-reguiated genes in mouse kidney. The treatment 
mice were dosed .with dihydrotestosterone, an androgen. 
Kidney androgen-regulated protein gene was used as a 
positive control, as it is known to be up-regulated by DHT. 

Holland, et al.. Abstract 607, ^^Identification of Genes 
Possibly Involved in Nephropathy of Bovine Growth Hormone 
Transgenic Mice'' (Endocrine Society Meeting, June 22, 2000) 
describes the screening strategy of Example 3, but does not 
disclose the identity of any of the differentially expressec 
clones . 

Coschigano, et al.. Abstract 333, ^^Identification of 
Genes Potentially Involved in Kidney Protection During 
Diabetes" (Endocrine Society Meeting, June 22, 2000) 
describes the screening strategy of Example 1, but does not 
disclose the identity of any of the differentially expressec 
clones . 

The following differential hybridization articles may 
also be of interest: 

Wada, et al., ^^Gene expression profile in 
streptozotocin-induced diabetic mice kidneys undergoing 
glomerulosclerosis''. Kidney Int, 59:1363-73 (2001); 

Song, et al., ^^Cloning of a novel gene in the human 
kidney homologous to rat muncl3S: its potential role in 
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diabetic nephropathy". Kidney Int., 53:1689-95 (1998); 

Page, et al., ''^Isolation of diabetes-associated kidney 
genes using differential display", Biochem. Biophys. Res. 
Comm., 232:49-53 (1997). 
5 Peradi, ^^Subtractive hybridization claims: An efficient 

technique to detect overexpressed mRNAs in diabetic 
nephropathy," Kidney Int. 53:926-31 (1998). 

Condorelli, EMBO J., 17:3858-66 (1998). 

See also WOOO/6.6784 (differential hybjridization 
10 screening for brown adipose tissue); PCT/USOO/12366, filed 
May 5, 2000 (differential hybridization screening for 
liver) . 

IdGiitiflcBtlon o£ g&rx&s involved in nephropathy 

15 Attention recently has focused on the generation of 

kidney mRNA .expression profiles and the identification of 
genes involved in nephropathy. Initial studies examined 
expression in the kidney of known genes ^ such as allV 
collagen, laminin Bl, TGF-Bl, 72 kDa collagenase, and TIMP- 

20 3, which are involved in extracellular matrix synthesis and 
degradation (Esposito et al., 1996*; Striker et al., 1996; 
Yang et al., 1997; Yang et al., 1994). Changes in 
facilitative glucose transporter, receptors for advanced 
glycation end products (RAGE) , and TGFB isoform and receptoi 

25 messages have also been examined (Chin et al., 1997; Hill et 
al., 2000; Ziyadeh et al . , 1997). With recent advances in 
gene profiling techniques, several databases docximenting 
mRNA expression in the kidney as a whole as well as in 
individual cell types are being generated (Takenaka et al., 

30 1998; Virion et al., 1999; Wada et al . , 2001). These 
techniques, as well as others, including differential 
display and subtractive hybridization, are being used to 
identify additional genes involved in the development and 
progression of diabetic nephropathy (Condorelli et al . , 

35 1998; Page et al . , 1997; Peraldi et al., 1998; Song et al., 
1998; Wada et al., 2001). 
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SUMMARY OF THE INVSNTZON 

Differential hybridization techniijues have been used tc 
identify mouse genes that are differentially expressed in 
mice, depending upon the extent or susceptibility to kidney 
5 damage. By identifying related human genes and proteins, 

one may identify agents useful in protecting humans against 
kidney damage. Protection against kidney damage mediated at 
least in part by growth hormone, and/or associated with 
diabetes, is of particular interest. 

10 Based on the differential expression of clones H8, HI, 

Fl, F2, F4,F5, F6, F27, F39, G38, G16, F16, B45, B3 and 
B4 6, we believe that the human proteins most closely 
corresponding to mouse vacuolar adenosine triphosphate, 
ubiquitin protein ligase Nedd-4, SON protein, FUSE binding 

15 protein 1, kidney androgen related protein, claudin 10, and 
heat shock protein 105 kDa (Hsp 105), a phosphotriesterase, 
certain immunoglobulins, and certain other proteins 
identified belbw, have a protective effect. 

Similarly, based on the differential expression of 

20 clones G26, C22, A8, A34, A39, A48, E39, F21, F38, F40, G9, 
G24 and G28, we believe that it would be desirable to 
inhibit the expression of the human proteins most closely 
corresponding to mouse 3-beta-hydroxysteroid dehydrogenase 
type 4, disable-2 p96 {Dab2), palmitylated serine/threonine 

25 kinase, tumor dif f erentiallly expressed 1 (TDEl), cytochrome 
oxidase III, ERG2 protein and/or glutathione peroxidase 
3, certain immunoglobulins, and certain other proteins 
identified below, or to neutralize their activity, with a 
suitable antagonist (e.g., a binding antibody or 

30 . oligopeptide) . 

Thus, Applicants contemplate: 

(1) use of the ^^f avorable'^ mouse DNAs of the Master 
Table (below) to isolate or identify related human DNAs; 

(2) use of human DNAs, related to favorable mouse DNAs, 
35 to express the corresponding human proteins; 

(3) use of the corresponding human proteins (and mouse 
proteins, if the sequence is sufficiently complete to be 
biologically active, and is active in hiimans) , to protect 
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against kidney damage; and 

(4) use of the corresponding mouse or human proteins / 
in diagnostic agents, in assays to measure kidney damage or 
protection against such damage. 
5 Moreover Applicants contemplate: 

(1) use of the ''unfavorable'' mouse DNAs of the Master 
Table to isolate or identify related human DNAs; 

(2) use- of the complement to the ''unfavorable" mouse 
DNAs or related human DNAs, as antisense molecules to 

10 inhibit expression of the related human DNAs; 

(3) use of the mouse or human DNAs to express the 
corresponding mouse or human proteins; 

(4) use of the corresponding mouse or human proteins, 
in diagnostic agents^ in assays to measure kidney damage; 

15 (5) use of the corresponding mouse or htiman proteins in 

assays to determine whether a substance binds to (and hence 
may neutralize) the protein; and 

(6) use of the neutralizing substance to protect 
against kidney damage . 

20 The related human DNAs may be identified by comparing 

the mouse sequence (or its AA translation product) to known 
human DNAs (and their AA translation products) . If this is 
unsuccessful/ human cDNA or genomic DNA libraries may be 
screened using the mouse DNA as a probe. 

25 Agents which bind the "favorable" and "unfavorable" 

nucleic acids, or the corresponding proteins (e.g., an 
antibody vs. the protein) may be used to screen for kidney 
damage . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Changes In glomerular volume (left panel) and 
urinary albumin excretion (right panel) as a function of 
5 duration of STZ-lnduced diabetes In wild-type mice. 

Glomerular measurements were made .from sections of kidney 
fixed in 4% paraformaldehyde' (Fig. lA) . Urinary albumin 
excretion was determined by radioimmune assay (RIA) using a 
. rat albumin antibody. Urinary volume output was normalized 
10 by creatinine concentration comparisons using a standard 

colorimetric assay (Fig. IB) . ND: nondiabetic; DB, diabetic. 

Figure 2. Physiological assessment of the progression of 
nephropathy in the kidney of NT and bGH female mice at 2, 5, 
and 12 months of age. Glomerular (Fig. 2A) and mesangial 
(Fig. 2BJ measurements were made from sections of kidney 
fixed in 4% paraformaldehyde. Urinary albumin excretion 
(Fig. 20) was determined by radioimmune assay (RIA) using a 
rat albumin antibody. Urinary volume output was normalized 
by creatinine concentration comparisons using a standard 
colorimetric assay. 
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DETAILED DESCRXPTIOK OF THE PREFERRED EMBODIMENTS OF TBE 
INVENTION 

Identified Differentially E^^ressed cDNAs 

In three differential expression studies (examples 1- 
3)/ we have obtained cDNAs which were differentially 
expressed • 

In Example 1, we looked ^t differential expression of 
genes in kidneys of streptozotocin-treated mice (a model of 
diabetes) as a result of GHR/BP knockout. The clones of 
interest included: G26, 33,, iii. Fl, G38 , G9, G16 , G24, G28 
(favorables underlined) . 

In Example 2, we obtained data on differential 
expression of genes in kidneys of nontransgenic mice as a 
result of streptozotocin treatment. .The clones of interest 
included: Fl, £4, F5, F6, F27, E39, F2, £16, F21, F38, F39 , 
and F40. 

In Example 3, we studied differential expression of 
genes in kidneys of mice as a result of overproduction of 
growth hormone. The clones of interest included: A8, A3 4, 
A39, A48, B45 , C22, £3/ B46. 

Direct and Indirect Utility of Identified Nucleic Add 
Sequences and Related Molecules 

The cDNAs of the disclosed clones may be used directly. 
For diagnostic or screening purposes, they (or specific . 
binding fragments thereof) may be labeled and used as 
hybridization probes. For therapeutic purposes, they (or 
specific binding fragments thereof) may be used as antisense 
reagents to inhibit the expression of the corresponding 
gene, or of a sufficiently homologous gene of another 
species . 

If the cDNA appears to be a full-length cDNA, that is, 
that it encodes an entire, functional protein,^ then it may 
be used in the expression of that protein. Such expression 
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may be in cell culture ^ with the protein subsequently 
isolated and administered exogenously to subjects who would 
benefit therefrom, or in vivo, i.e., administration by gene 
therapy. Naturally, any DNA encoding the same protein, or* a 
fragment or a mutant protein which retains the desired 
activity, may be used for the same purpose. The encoded 
protein of course has utility therapeutically and, in 

.labeled or immobilized form, diagnostically . 

The cDNAs of the disclosed clones may also be used 
indirectly, that is, to identify other useful DNAs, 

'prdteins, or other molecules. We have attempted to 
determine whether the cDNAs disclosed herein have 
significant similarity to any known DNA, and whether, in an^ 
of the six possible combinations of reference frame and 
strand, they encode a protein similar to a known protein. 
If so, then it follows that the known protein, and DNAs 
encoding that protein, may be used in a similar manner. In 
addition, if the known protein is known to have additional 
homologues, then those homologous proteins, and DNAs 
encoding them, may be used in a similar manner. 

There thus are several ways that a human protein 
homologue of interest can be identified by database 
searching , including : 

1) a DNA->DNA (BlastN) search for database DNAs closely 
related to the mouse cDNA clone identifies a particular 
mouse (or other nonhuman, e.g., rat) gene, and that nonhviman 
gene encodes a protein for which there is a known humane- 
protein homologue; 

2) a DNA->Protein (BlastX) search for database proteins 
closely related to the translated DNA of the mouse cDNA 
clone identifies a particular mouse (or other nonhuman) 
protein, and that nonhuman protein has a known human protein 
homologue; 
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3) a DNA-->DNA (BlastN) search of the database for human DNAs 
closely related to the mouse cDNA clone identifies a 
particular human DNA as a homologue of the mouse cDNA, and 
the corresponding human protein is known (e.g./ by 

5 translation of the human DNA) ; and 

4) a DNA->Protein (BlastX) search of the database for human 
proteins closely related to the translated DNA of the mouse 
cDNA clone identifies a particular human protein as a 

10 homologue of the corresponding mouse protein. 

Thus/ if we have identified a mouse cDNA, and it 
encodes a mouse protein which appears similar to a human 
protein, then that human protein may be used (especially in 

15 humans) for purposes analogous to the proposed use of the 
mouse protein in mice. Moreover, a specific binding 
fragment of an appropriate strand of the corresponding human 
gene or cDNA could be labeled and used as a hybridization 
probe (especially against samples of human mRNA or cDNA) . 

20 In determining whether the disclosed cDNAs have 

significant similarities to known DNAs (and their translated 
AA sequences to known proteins) , one would generally use the 
disclosed cDNA as a query sequence in a search of a sequence 
database. The results of several such searches are set 

25 forth in the Examples. Such results are dependent, to some 
degree, on the search parameters. Preferred parameters are 
set forth in Example 1. The results are also dependent on 
the content of the database. While the raw similarity score 
of a particular target (database) sequence will not vary 

30 with content (as long as it remains in the database) , its 

informational value (in bits), expected value, and relative 
ranking can change. Generally speaking, the changes are 
small • 

35 It is possible to use the sequence of the entire cDNA 

insert to query the database. However, the error rate 
increases as a sequencing run progresses. Hence, it may be 
beneficial to search the database using a truncated 
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(presumably more accurate) sequence, especially if the 
insert is quite long. 

It will be appreciated that the nucleic acid and 
protein databases keep growing. Hence a later search, may 
5 identify high scoring target sequences which were not 

uncovered by an earlier search because the target sequences 
were. not previously part of a database. 

Hence, in a preferred embodiment, the cognate DNAs and 
proteins include not only those set forth in the examples, 
10 but those which would have been highly ranked (top ten, more 
preferably top three, even more preferably top two, most 
preferably the top one) in a search run with the same 
parameters on the date of filing of this application. 

15 If the cDNA appears to be a partial cDNA, it may be 

used as a hybridization probe to isolate the full-length 
cDNA. If the partial cDNA encodes a biologically functional 
fragment of the cognate protein, it may be used in a manner 
similar to the full length cDNA, i.e., to produce the 

20 * functional fragment. 

If we have indicated . that an antagonist of a protein or 
other molecule is useful, then such an antagonist may be 
obtained by preparing a combinatorial library, as described 

25 below, of potential antagonists, and screening the library 
members for binding to the protein or other molecule in 
question. The binding members may then be further screened 
for the ability to antagonize the biological activity of the 
target. The antagonists may be used therapeutically, or, in 

30 suitably labeled or immobilized form, diagnostically. 

If the cDNA is related to a known protein, then 
substances known to interact with that protein (e.g., 
agonists, antagonists, substrates, receptors, second 
messengers, regulators, and so forth) , and binding molecules 

35 which bind them, are also of utility. Such binding 
molecules can likewise be identified by screening a 
combinatorial library . 
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Xsola'tlon of Full Length oDMA^ Using Partial cDKAs as probes 

If it is determined that a cDNA of the present 
invention is a partial cDNA, and the cognate full length 
cDNA is not listed in a sequence database r the available 
5 cDNA may be used as a hybridization probe to isolate the 
fullr-length cDNA from a suitable cDNA library. 

Stringent hybridization conditions are appropriate^ 
that is, conditions in which the hybridization temperature 
is 5-10 deg. C. below the Tm of the cDNA as a perfect 
:10. duplex. 

Identification and Isolation of Homologous Genes/cDNAs Using 
a cDNA Probe 

It may be that the sequence databases available do not 

15 include the sequence of any homologous gene, or at least of 
.the homologous gene for a species of interest. However^ 
given the cDNAs set forth above, one may readily obtain the 
homologous gene - 

The possession of one cDNA (the "starting DNA") 

20 greatly facilitates the isolation of homologous genes/cDNAs. 
If the clone in question only features a partial cDNA, this 
partial cDNA may first be used as a probe to isolate the 
.corresponding full length cDNA for the same species, and 
that the latter may be used as the starting DNA in the 

25 search for homologous genes. 

The starting DNA, or a fragment thereof, is used as a 
hybridization probe to screen a cDNA or genomic DNA library 
for clones containing inserts which encode either the entire 
homologous protein, or a recognizable fragment thereof. The 

30 minimum length of the hybridization probe is dictated by the 
need for specificity. If the size of the library in bases 
is L, and the GC content is 50%, then the probe should have 
a length of at least 1, where L = 4^. This will yield, on 
average, a single perfect match in random DNA of L bases. 

35 The human cDNA library is about 10® bases and the human 
genomic DNA library is about 10" bases. 

The library is preferably derived from an organism 
which is known, on biochemical evidence, to produce a 
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homologous protein, and more preferably from the genomic DNf 
or mRNA of cells of that organism which are likely to be 
relatively high producers of that protein. A cDNA library 
(which is derived from an mRNA library) is especially 
preferred. 

If the organism in question is known to have 
substantially different codon preferences from that of the 
organism whose relevant cDNA or genomic DNA is known, a 
synthetic hybridization probe may.be used which encodes the 
same amino acid sequence but whose codon utilization is mor€ 
similar to that of the DNA of the target organism. 
Alternatively, the synthetic probe may employ inosine as a 
substitute for those bases which are most likely to be 
divergent, or the probe, may be a mixed probe which mixes the 
codons for the source DNA with the preferred codons 
(encoding the same amino acid) for the target organism. 

• By routine methods, the Tm of a perfect duplex of 
starting DNA is determined. One may then select a 
hybridization temperature which is sufficiently lower than 
the perfect duplex Tm to allow hybridization of the startinc 
DNA (or other probe) to a target DNA which is divergent fron 
the starting DNA. A 1% sequence divergence typically lowers 
the Tm of a duplex by 1-2 °C, and the DNAs encoding . 
homologous proteins of different species typically have 
sequence identities of around 50-80%. Preferably, the 
library is screened under conditions where the temperature 
is at least 20*'C., more preferably at least SO^'C, below the 
perfect duplex Tm. Since salt reduces the Tm, one 
ordinarily would carry out the search for DNAs encoding 
highly homologous proteins under relatively low salt 
hybridization conditions, e.g., <1M NaCl . The higher the 
salt concentration, and/or the lower the temperature, the 
greater the sequence divergence which is tolerated. 

For the use of probes to identify homologous genes in 
other species, see, e.g., Schwinn, et al . , J. Biol. Chem. , 
265:8183-89 (1990) (hamster 67-bp cDNA probe vs. human 
leukocyte genomic library; human 0.32kb DNA probe vs. bovine 
brain cDNA library, both with hybridization at 42 in 
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6xSSC); Jenkins et al., J. Biol. Chem., 265:19624-31 (1990) 
(Chicken 770-bp cDNA probe vs. human genomic libraries; 
hybridization at 40'*C in 50% formamide and 5xSSC) ; Murata et 
al., J. Exp. Med., 175:341-51 (1992) (1.2-kb mouse cDNA 
probe V. human eosinophl cDNA library; hybridization at 65*^C 
in 6xSSC) ; Guyer et al., J. Biol. Chem., 265:17307-17. (1990) 
(2.95-kb human genomic DNA probe vs. porcine genomic DNA 
library; hybridization at 42**C in 5xSSC) - The conditions 
set forth in these articles may each be considered suitable 
for the purpose of isolating homologous genes. 

Homologous Proteins axid DNAs 

A human protein can be said to be identifiable as 
homologous to a mouse cDNA clone if 

(1) its sequence can be aligned to the mouse cDNA clone, 
using BlastX with the default parameters set forth below, 
and the expected value (E) of the alignment (the probability 
that such an alignment would have occurred by chance alone) 
is less than e-10, or 

(2) it is encoded by a hiiman gene whose cDNA can be aligned 
to the mouse cDNA clone, using BlastN with the default 
parameters set forth below, and the E value of the alignment 
is less than e-10. 

Desirably, both conditions are satisfied. 

Preferably, for either or both conditions, the E value 
is less than e-15, more preferably less than e-20, still 
more preferably less than e-40, even more preferably less 
than e-60, considerably more preferably less than e-80, and 
most preferably less than e-100. 



BlastN and BlastX report very low expected values as 
^^0.0". This does not truly mean that the expected value is 
exactly zero (since any alignment could occur by chance) , 
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but merely that it is so infinitesimal that it is not 
reported. The documentation does not state the cutoff 
value, but the data below includes alignments with explicit 
E values as low as e-178 (624 bits), while a score of 636 
bits was reported as ^^0.0". 

Functionally homologous human proteins are also. of 
interest* A human protein may be* said to be functionally 
homologous to the mouse cDNA clone if (1) there is a mouse 
protein which is encoded by a mouse gene whose cDNA can be 
aligned to the mouse cDNA clone, using BlastX with the 
default parameters set forth below, and the E value of the 
alignment is less than e-50, and (2) the human protein has 
at least one biological activity in common with the mouse 
protein. 

The human proteins of interest also include those that 
are substantially and/or conservatively identical (as 
defined below) to the homologous and/or functionally 
homologous htiman proteins defined above. 



Relevance of Favorable and Unfavorable Genes 

If a gene is down-regulated in protected mammals, or 
up-regulated in damaged mammals, (i.e., an ^^unfavorable 
gene") then several utilities are apparent. 

First, the complementary strand of the gene, or a 
portion thereof, may be used in labeled form as a 
hybridization probe to detect messenger RNA and thereby 
monitor the level of expression of the gene in a subject. 
Elevated levels are indicative of damage, or possibly of a 
propensity to damage, and clinicians may take appropriate 
preventative, curative or ameliorative action. 

Secondly, the messenger RNA product (or equivalent 
cDNA) , the protein product, or a binding molecule specific 
for that product (e.g., an antibody which binds the 
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product) / or a downstream product which mediates the 
activity (e.g., a signaling intemediate) or a binding 
molecule (e.g., an antibody) therefor, may be used, 
preferably in labeled or immobilized form, as an assay 
5 reagent in an assay for said nucleic acid product, protein 
product, or downstream product (e.g., a signaling 
intermediate) . Again, elevated levels are indicative of a 
present or future problem. 

Thirdly,, an agent which down-regulates expression of 

10 the gene may be used to reduce levels of the corresponding 
protein and thereby inhibit further damage to the kidney. 
This agent could inhibit transcription of the gene in the 
subject, or translation of the corresponding messenger RNA. 
Possible inhibitors of transcription and translation include 

15 antisense molecules and repressor molecules. The agent 

could also inhibit a post-translatipnal modification (e.g., 
glycosylation, phosphorylation, cleavage, GPI attachment) 
required for activity, or post-translationally modify the 
protein so as to inactivate it. Or it could be an agent 

20 which down- or up-regulated a positive or negative 
regulatory gene, respectively. 

Fourthly, an agent which is an antagonist of the 
messenger RNA product or protein product of the gene, or of 
a downstream product through which its activity is 

25 manifested (e.g., a signaling intermediate), may be used to 
inhibit its activity. This antagonist could be an antibody. 

Fifthly, an agent which degrades, or abets the 
degradation of, that messenger RNA, its protein product or a 
downstream product which mediates its activity (e.g., a. 

30 signaling intermediate), may be used to curb the effective 
period of activity of the protein. 

If a gene is up -reaulated in protected mammals, or 
down -regulated in damaged animals then the utilities are 
converse to those stated above. 

35 First, the complementary strand of the gene, or a 

portion thereof, may be used in labeled form as a 
hybridization probe to detect messenger RNA and thereby 
monitor the level of expression of the gene in a subject. 
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Depressed levels are indicative of damage/ or possibly of a 
propensity to damage, and clinicians may take appropriate 
preventative, curative or ameliorative action. 

Secondly, the messenger .RNA product, the equivalent 
5 cDNA, protein product, or a binding molecule specific for 
those products, or a downstream product, or a signaling, 
intermediate, or a binding molecule therefor, may be used, 
preferably in labeled or immobilized form, as an assay 
reagent in an assay for said protein product or downstream 

10 product. Again, depressed levels are indicative of a * * 
present or future problem. 

Thirdly, an agent which up-regulates expression of the 
gene may be used to increase levels of the corresponding 
protein and thereby inhibit further damage to the kidney* 

15 By way of example, it could be a vector which carries a copi 
of the gene, but which expresses the gene at higher levels 
than does the endogenous expression system. Or it could be 
an agent which up- or down-regulates a positive or negative 
regulatory gene. 

20 Fourthly, an agent which is an agonist of the protein * 

product of the gene, or of a downstream product through 
which its activity (of inhibition of kidney damage) is 
manifested, or of a signaling intermediate may be used to 
foster its activity - 

25 Fifthly, an agent which inhibits the degradation of 

that protein product or of a downstream product or of a 
signaling intermediate may be used to increase the effective 
period of activity of the protein. 

30 Relevance of Antibody-Related Genes 

Some of the disclosed clones (e.g., A8, A34, A39, A48, 
E39, G38) appear to be related to antibody genes. It is 
widely accepted that diabetes is a microvasculature problem. 
Elevated levels of glucose or lack of insulin or insulin 
35 resistance result in pathological problems with capillaries. 
The lack of proper blood supply to a variety of tissues 
results in tissue damage including neuronal problems 
(neurology) , kidney problems (nephropathy) , cardiovascular 
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problems, and eye problems (proliferative diabetic 
retinopathy) . Severe blood vessel problems (macrovascular 
disorders) can lead to necrotic tissue that often results in 
amputations • 

Also, there is considerable evidence that the 
underlying etiology of Type I diabetes is a problem with the 
immune system, in that an autoimmune reaction is mounted 
that results in destruction of the Beta (insulin producing) 
cells of the pancreas. 

Immune reactions have been shown to be involved in 
glomerulosclerosis, see Chen, et al.. Nephron, 78: 63-72 
(1998) and Linder, et al., Clin. Iinjnaunol . Immunopathol . , 
1:104-21 (1972), and to occur in transgenic mice expressing 
bovine GH and exhibiting arthritis, see Ogueta, supra. 

Thus, it is entirely possible .that in a type 11 
diabetic (and in other tissues of Type I diabetics) , 
autoimmune reactions take place. The scenario could be as 
follows: due to inappropriate blood flow to a given tissue 
' (for example, the kidney), pathological changes occur. 
These changes could be in the form of glomerulsclerosis,- 
necrosis, or a variety of other problems. Proteins found in 
the tissue may become altered during these pathological 
changes (for example inappropriate glycosylation, 
phosphorylation, protein cleavage) and this ultimately 
results in the exposure of ^^new'' or foreign epitopes to the 
host. Thus, an autoimmune reaction could be mounted against 
these proteins that impact the destruction of the tissue. 

It follows that inhibition of these autoimmune 
antibodies can be beneficial . 

Conceivably, as titers of these autoimmune antibodies 
rise, a new immune response, featuring anti-idiotypic 
antibodies, will occur; such antibodies would be 
^^f avorable'' • 
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Genes /l&ro'belns of Xxi'beres'b 

Growth hormone (GH) is thought to play a role in the 
progression of kidney damage in mice. Bovine (b) GH 
transgenic mice possess elevated levels of insulin-like 
growth factor -1 (IGF-1) / elevated levels of insulin, a 
giant phenotype, and develop severe kidney damage resulting 
in death within 7-12 months. GH antagonist mice have 
decreased levels of IGF-1, a dwarf phenotype, and fail to 
develop kidney damage, even when made diabetic by treatment 
with streptozotocin (Stz). Therefore, the bGH mice that 
develop kidney damage independent of diabetes but with 
characteristics similar to Stz-induced diabetic nephropathy 
may provide a model system for determining GH' s role in the 
progression of kidney damage. As a consequence of 
determining the genes activated or inactivated in 
correlation with* the morphology of the kidney, as it 
progresses towards glomerulosclerosis, agents may be 
identified which can protect kidneys from glomerulosclerosis 
and other kidney damage. 

Favorable genes/proteins include the following: 
- SON protein 

-vacuolar adenosine triphosphatase 

- Ac3 9 /phy s ophi 1 in 

-ubiquitin protein ligase Nedd-4 

-kindney androgen regulated protein 

-claudins 

-heat shock proteins 

-far upstream element (FUSE) binding protein 

Unfavorable genes /proteins include the following: 
-3 beta-hydroxysteroid dehydrogenase, 
especially of type IV or V. 
-disabled-2 (dab-2) p96 
-palmitylated serine/threonine kinase 
-tumor differentially expressed (TDE) 1 
-cytochrome oxidase III 
-ERG2 protein 
-TLH29 protein precursor 
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-glutathoione peroxidase 3 

Mu'ban'b Pro1;elns 

5 The present invention also contemplates mutant proteins 

(peptides) which are substantially identical (as defined 
below) to the parental protein (peptide). In general, the 
fewer the mutations, the more likely the mutant protein is 
to .retain the activity of the parental protein. The effect ' 
10 of mutations is usually (but not always) additive. Certain . 
individual mutations are more likely to be tolerated than 
others . 

A protein is more likely to tolerate a mutation which 

(a) is a substitution rather than an insertion or 
15 deletion; 

(b) is an insertion or deletion at the terminus, 
rather than internally, or, if internal, is at a 
domain boundary, or a loop or turn, rather than in 
an alpha helix or beta strand; 

20 (c) affects a surface residue rather than an 

interior residue; 

(d) affects a part of the molecule distal to the 
binding site;. 

(e) is a substitution of one amino acid for 
25 another of similar size, charge, and/or 

hydrophobicity, and does not destroy a disulfide 
bond or other crosslink; and 

(f) is at a site which is subject to substantial 
variation among a family of homologous proteins to 

30 which the protein of interest belongs. 

These considerations can be used to design functional 
mutants . 

Surface vs. Interior Residues 
35 Charged residues almost always lie on the surface of 

the protein. For uncharged residues, there is less 
certainty, but in general, hydrophilic residues are 
partitioned to the surface and hydrophobic residues to the 
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interior. Of course, for a membrane protein, the membrane- 
spanning segments are likely to be rich in hydrophobic 
residues. 

Surface residues may be identified experimentally by 
5 various labeling techniques, or by 3-D structure mapping 

techniques like X-ray diffraction and NMR. A 3-D model- of a 
homologous protein can be helpful. 

Binding Site Residues 

10 Residues forming the binding site may be identified by 

(1) comparing the effects of labeling the surface residues 
before and after complexing the protein to its target, (2) 
labeling the binding site directly, with affinity ligands, 
(3) fragmenting the protein and testing the fragments for 

15 binding activity, and (4) systematic mutagenesis (e.g., 
alanine-scanning mutagenesis) to determine which mutants 
destroy binding. If the binding site of a homologous 
protein is known, the binding site may be postulated by 
analogy. 

20 Protein libraries" may be constructed and screened that 

a large family (e.g., 10^) of related mutants may be 
evaluated simultaneously. 

Hence, the mutations are preferably conservative 
modifications as defined below. 

25 

"Substantially Identical" 

A mutant protein (peptide) is substantially identical 
to a reference protein (peptide) if (a) it has at least 10% 
of a specific binding activity or a non-nutritional 

30 biological activity of the reference protein, and (b) is at 
least 50% identical in amino acid sequence to the reference 
protein (peptide) . It is "'substantially structurally 
identical'' if condition (b) applies, regardless of (a) . 
Percentage amino acid identity is determined by 

35 aligning the mutant and reference sequences according to a 

rigorous dynamic prograimning algorithm which globally aligns 
their sequences to maximize their similarity, the similarity 
being scored as the sum of scores for each aligned pair 
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according to an unbiased PAM250 laatrix/ and a penalty for 
each internal gap of -12 for the first null of the gap and - 
4 for each additional null of the same gap. The percentage 
identity is the number of matches expressed as a percentage 
5 of the adjusted (i.e./ counting inserted nulls) length of 
the reference sequence. 

A* mutant DNA sequence is substantially identical to a 
reference DNA sequence if they are structural sequences, and 
encoding mutant and reference proteins which are 

10 . substantially identical as described above. 

If instead they are regulatory sequences, they are 
substantially identical if the mutant sequence has at least 
10% of the regulatory activity of the reference sequence, 
and is at least 50% identical in nucleotide sequence to the 

15 reference sequence. Percentage identity is determined as 

for proteins except that matches are scored +5, mismatches - 
4, the gap open penalty is -12, and the gap extension 
penalty (per additional null) is -4. 

Preferably, sequence which are substantially identical 

20 exceed the minimum identity of 50% e.g., are 51%, 66%, 75%, 
80%, 85%, 90%, 95% or 99% identical in sequence. 

DNA sequences may also be considered "substantially 
identical" if they hybridize to each other under stringent 
conditions, i.e., conditions at which the Tm of the 

25 heteroduplex of the one strand of the mutant DNA and the 
more complementary strand of the reference DNA is not in 
excess of 10 ^'C. less than the .Tm of the reference DNA 
homoduplex. Typically this will correspond to a percentage 
identity of 85-90%. 

30 • 

"Conservative Modifications" 

"Conservative modifications" are defined as 
(a) conservative substitutions of amino acids as 
hereafter defined; or 
35 (b) single or multiple insertions (extension) or 

deletions (truncation) of amino acids at the 
termini . 

Conservative modifications are preferred to other 
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modifications. Conservative substitutions are preferred to 
other conservative modifications . 

"Semi-Conservative Modifications" are modifications 
which are not conservative, but which are (a) semi- 
5 conservative substitutions as hereafter defined; or (b) 

single or multiple insertions or deletions internally, but 
at interdomain boundaries, in loops or in other segments of 
relatively high mobility • Semi-conservative modifications 
are preferred to nonconservative modifications, Semi- 

10 conservative substitutions are preferred to other seial- 
conservative modifications . 

Non-conservative substitutions are preferred to other 
non-conservative modifications . 

The term "conservative" is used here in an a priori 

15 sense, i.e., modifications which would be expected to 

preserve 3D structure and activity, based on analysis of the 
naturally occurring families of homologous proteins and of 
past experience with the effects of deliberate mutagenesis, 
rather than post facto ^ a modification already known to 

20 conserve activity. Of course, a modification which is 

conservative a priori may, and usually is, also conservative 
post facto .' 

Preferably, except at the termini, no more than about 
five amino acids are inserted or deleted at a particular 
25 locus, and the modifications are outside regions known to 
contain binding sites important to activity. 

Preferably, insertions or deletions are limited to the 
termini . 

A conservative substitution is a substitution of one 
30 amino acid for another of the same exchange group, the 
exchange groups being defined as follows 

I Gly, Pro, Ser, Ala (Cys) (and any nonbiogenic, 
neutral amino acid with a hydrophobicity not 
exceeding that of the aforementioned a.a.'s) 
35 II Arg, Lys, His (and any nonbiogenic, positively- 

charged amino acids) 
III Asp, Glu, Asn, Gin (and any nonbiogenic 
negatively-charged amino acids) 
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IV Leu/ lie. Met, Val (Cys) (and any nonbiogenic, 
aliphatic, neutral amino acid with a 
hydrophobicity too high for I above) 

V Phe, Trp, Tyr (and any nonbiogenic, aromatic 

5 neutral amino acid with a hydrophobicity too high 

for I above) . 
Note that Cys belongs to both I and IV. 
Residues Pro, Gly and Cys have special conformational 
ro.les. Cys participates in formation of disulfide bonds. 
10 -Gly imparts flexibility to the chain. Pro imparts rigidity 
to the chain and disrupts a helices. These residues may be 
. essential in certain regions of the polypeptide, but 
substitutable elsewhere. 

One, two or three conservative substitutions are more 
15 likely to be tolerated than a larger number. 

"Semi-conservative substitutions" are defined herein as 
being substitutions within supergroup I/II/III or within 
supergroup IV/V, but not within a single one of groups I-V. 
They also include replacement of any other amino acid with 
20 alanine. If a substitution is not conservative, it 
preferably is semi-conservative. 

'.'Non-conservative substitutions" are substitutions 
which are not ^^conservative'' or "serai-conservative". 

"Highly conservative substitutions" are* a subset of 
25 conservative substitutions, and are exchanges of amino acids 
within the groups Phe/Tyr/Trp, Met/Leu/Ile/Val, His/Arg/Lys, 
• Asp/Glu and Ser/Thr/Ala. They are more likely to be 
tolerated than other conservative substitutions. Again, the 
smaller the number of substitutions, the more likely they 
30 are to be tolerated. 

''Conservatively Identical'' 

A protein (peptide) is conservatively identical to a 
reference protein (peptide) it differs from the latter, if 
35 at all, solely by conservative modifications, the protein 
(peptide remaining at least seven amino acids long if the 
reference protein (peptide) was at least seven amino acids 
long. 



I 
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A protein is at least semi-conservatively identical to 
a reference protein (peptide) if it differs from the latter^ 
if at all, solely by semi-conservative or conservative 
modifications . 

5 . A protein (peptide) is nearly conservatively identical 

to a reference protein (peptide) if it differs from thfe 
latter, if at all, solely by one or more conservative 
.modifications and/or a single nonconservative substitution. 

It is highly conservatively identical if it differs, if 
10 at all, solely by highly conservative substitutions. Highly 
conservatively identical proteins are preferred to those 
merely conservatively identical. An absolutely identical 
protein is even more preferred. 

15 

The core sequence of a reference protein (peptide) is- 
the largest single fragment which retains at least 10% of a 
particular specific binding activity, if one is specified, 
or otherwise of at least one specific binding activity of 

20 the referent. If the referent has more than one specific 
binding activity, it may have more than one core sequence, 
and these may overlap or not. 

If it is taught that a peptide of the present invention 
may have a particular similarity relationship (e.g., 

25 markedly identical) to a reference protein (peptide), 
preferred peptides are those which comprise a sequence 
having that relationship to a core sequence of the reference 
protein (peptide), but with internal insertions or deletions 
in either sequence excluded. Even more preferred peptides 

30 are those whose entire sequence has that relationship, with 
the same exclusion, to a core sequence of that reference 
protein (peptide) . 



35 Iiibrairy 

The term "library" generally refers to a collection of 
chemical or biological entities which are related in origin, 
structure, and/or function, and which can be screened 
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simultaneously for a property of interest. 

Libraries may be classified by how they are constructed 
(natural vs. artificial diversity; combinatorial vs. 
noncombinatorial) , how they are screened (hybridization, 
5 expression, display) , or by the nature of the screened 
library members (peptides, nucleic acids, etc.)* 

In a ^^natural diversity'' library, essentially all of 
the diversity arose without .human intervention. This would 
be true, for example, of messenger RNA extracted from a non- 
10 engineered cell. 

In a synthetic diversity" library, essentially all of 
the diversity arose deliberately as a result of human 
intervention.* This would be true for example of a 
combinatorial library; note that a small level of natural 
15 diversity could still arise as a result of spontaneous 
mutation. It would also be true of a noncombinatorial 
library of compounds collected from diverse sources, even if 
they were all natural products. 

In a ^^non-natural diversity" library, at least some of 
20 the diversity arose deliberately through human intervention. 

In a ^^controlled origin" library, the source of the 
diversity is limited in some way. A limitation might be to 
cells of a particular individual, to a particular species, 
or to a particular genus, or, more complexly, to individuals 
25 of a particular species who are of a particular age, sex, 

physical condition, geographical location, occupation and/or 
familial relationship. Alternatively or additionally, it 
might be to cells of a particular tissue or organ. .Or it 
could be cells exposed to particular pharmacological, * *»• 
30 environmental, or pathogenic conditions. Or the library 

could be of chemicals, or a particular class of chemicals, 
produced by such cells. 

In a ^'controlled structure" library, the library 
members are deliberately limited by the production 
35 conditions to particular chemical structures. For example, 
if they are oligomers, they may be limited in length and 
monomer composition, e.g. hexapeptides composed of the 
twenty genetically encoded amino acids. 
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Hybridization Library 

In a hybridization library, the library meinbers are 
nucleic acids, and are screened using a nucleic acid 
hybridization probe. Bound nucleic acids may then be' 
5 sequenced. 

Expression Library 

In an expression library, the screened library members 
are gene expression products, but one may also speak of an 

10 underlying library of genes encoding those products. The 
library is made by subcloning DNA encoding the library 
members (or portions thereof) into expression vectors (or 
into cloning vectors which subsequently are used to 
construct expression vectors) , each vector comprising an 

15 expressible gene encoding a particular library member, 

introducing the expression vectors into suitable cells, and 
expressing the genes so the expression products are 
produced. 

In one embodiment, the expression products are 

20 secreted, so the library can be screened using an affinity 
reagent, such as an antibody or receptor. The bound 
expression products may be sequenced directly, or their 
sequences inferred by, e.g., sequencing at least the 
variable portion of the encoding DNA. 

25 In a second embodiment, the cells are lysed, thereby 

exposing the expression products, and the latter are 
screened with the affinity reagent. 

In a third embodiment, the cells express the library 
members in such a manner that they are displayed on the 

30 surface of the cells, or on the surface of viral particles 
produced by the cells. (See display libraries, below). 

In a fourth embodiment, the screening is not for the 
ability of the expression product to bind to an affinity 
reagent, but rather for its ability to alter the phenotype 

35 of the host cell in a particular detectable manner. Here, 
the screened library members are transformed cells, but 
there is a first underlying library of expression products 
which mediate the behavior of the cells, and a second 
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underlying library of genes which encode those products. 

Display JLibra^ry 

In a display library /. the library members are each 
5 conjugated to, and displayed upon, a support of some kind. 
The support may be living (a cell or virus), or nonliving 
(e.g.*, a bead or plate) • 

If the support is a cell or virus, display will 
normally be effectuated by expressing a fusion protein which 

10 comprises .the library member, a carrier moiety allowing 

integration of the fusion protein. into the surface of the 
cell or virus, and optionally a lining moiety. In a 
variation on this theme, the cell coexpresses a first fusion 
comprising the library member and. a linking moiety Ll, and a 

15 second fusion comprising a linking moiety L2 and the carrier 
moiety. LI and L2 interact to associate the first fusion 
with the second fusion and hence, indirectly, the library 
member with the surface of the cell or virus. 

20 Soluble Library 

In a soluble library, the library members are free in 
solution. A soluble library may be produced directly, or 
one may first make a display library and then release the 
library members from their supports . 

25 

Encapsulated Library 

In an encapsulated library, the library members are 
inside cells or liposomes. Generally speaking, encapsulated 
libraries are used to store the library members for future 
30 use; the members are extracted in some way for screening 
purposes. However, if they differentially affect the 
phenotype of the cells, they may be screened indirectly by 
screening the cells. 



35 cDNA Library 

A cDNA library is usually prepared by extracting RNA 
from cells of particular origin, fractionating the RNA to 
isolate the messenger RNA (mRNA has a poly (A) tail, so this 
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is usually done by oligo-dT affinity chromatography) , 
synthesizing complementary DNA (cDNA) using reverse 
transcriptase, DNA polymerase/ and other enzymes, subcloning 
the cDNA into vectors, and introducing the vectors into 
5 cells. Often, only mRNAs or cDNAs of particular sizes will 
be. used, to make it more likely that the cDNA encodes a • 
functional polypeptide. 

A cDNA library explores the natural diversity of the 
transcribed DNAs of cells from a particular source. It is 
10 not a combinatorial library. 

A cDNA library may be used to make a hybridization 
library, or it may be used as an (or to make) expression, 
library. 

Genomic DNA Librarv 

A genomic DNA library is made by extracting DNA from a 
particular source/ fragmenting the DNA, isolating fragments 
of a particular size range, subcloning the DNA fragments 
into vectors, and introducing the vectors into cells. 

Like a cDNA library, a genomic DNA library is a natural 
diversity library, and not a condDinatorial library. A 
genomic DNA library may be used the same way as a cDNA 
library. 

25 synthetic DNA librarv 

A synthetic DNA library may be screened directly (as a 
hybridization library) , or used in the creation of an 
expression or display library of peptides/proteins . 

Combinatorial Libraries 

The term "combinatorial library" refers to a library in 
which the individual members are either systematic or random 
combinations of a limited set of basic elements, the 
properties of each member being dependent on the choice and 
location of the elements incorporated into it. Typically, 
the members of the library are at least capable of being 
screened simultaneously. Randomization may be complete or 
partial; some positions may be randomized and others 
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predetermined, and at random positions, the choices may be 
limited in a predetermined manner. The members of a 
' combinatorial library may be oligomers or polymers of some 
kind, in which the variation, occurs through the choice of 
5 monomeric building block at one or more positions of the 

oligomer or polymer, and possibly in terms of the connecting 
linkage, or the length of the oligomer or polymer, too. Or 
the members may be nonoligomeric molecules with a standard 
core structure, like the 1, 4-benzodiazepine structure, with 

10 the variation being introduced by the choice of substituents 
at particular variable sites on the core structure. Or the 
members may be nonoligomeric molecules assembled* like a 
jigsaw puzzle, but wherein each piece has both one or more 
variable moieties (contributing to library diversity) and 

15 one or more constant moieties (providing the functionalities 
for coupling the piece. in question to other pieces) . 

Thus, in a typical combinatorial library, chemical 
building blocks are at least partially randomly combined 
into a large number (as high as 10E15) of different 

20 compounds, which are then simultaneously screened for 

binding (or other) activity against one or more targets. 

In a "simple combinatorial library", all of the members 
belong to the same class of compounds (e.g., peptides) and 
can be synthesized simultaneously. A "composite 

25 combinatorial library" is a mixture of two or more simple 
libraries, e.g., DNAs and peptides, or peptides, peptoids, 
and PNAs, or benzodiazepines and carbamates. The number of 
component simple libraries in a composite library will, of 
course, normally be smaller than the average number of 

30 members in each simple library, as otherwise the advantage 
of a library over individual synthesis is small. 

Libraries of thousands, even millions, of random 
oligopeptides have been prepared by chemical synthesis 
(Houghten et aJ., Nature, 354:84-6(1991)), or gene 

35 expression (Marks et aJ-, J Mol Biol, 222:581-97(1991)), 

displayed on chromatographic supports (Lam et ai.. Nature, 
354:82-4(1991)), inside bacterial cells (Colas et al.. 
Nature, 380:548-550(1996)), on bacterial pili (Lu, 
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Bio/Technology, 13:366-372(1990))^ or phage (Smith, Science, 
228:1315-7(1985)), and screened for binding to a variety of 
targets including antibodies (Valadon et al., J Mol Biol, 
261:11-22(1996)), cellular proteins (Schmitz et al., J Mol 
5 Biol, 260:664-677(1996)), viral proteins (Hong and 

Boulanger, Embo J, 14:4714-4727(1995)), bacterial proteins 
(Jacobsson and Frykberg, ■ Biotechniques, 18:878-885(1995)), 
nucleic acids (Cheng et al.. Gene, 171:1-8(1996)), and 
plastic (Siani et al., J Chem Inf Comput Sci, 34:588- 

10 593(1994) ) . 

Libraries of proteins (Ladner, USP 4, 664, 989) , • peptoids 
(Simon et al., Proc Natl Acad Sci USA, 89:9367-71(1992)), 
nucleic acids (Ellington and Szostak, Nature, 
246:818(1990)), carbohydrates, and small organic molecules 

15 (Eichler et al., Med Res Rev, 15:481-96(1995)) have also 
been prepared or suggested for drug screening purposes. 

The first combinatorial libraries were composed of 
peptides or proteins, in which all or selected amino acid 
positions were randomized. * Peptides and proteins can exhibit 

20 high and specific binding activity, and can act as* 

catalysts. In consequence, they are of great importance in 
biological systems . 

Nucleic acids have also been used in combinatorial 
libraries. Their great advantage is the ease with which a 

25 nucleic acid with appropriate binding activity can be 

amplified. As a result, combinatorial libraries composed of 
nucleic acids can be of low redundancy and hence, of high 
diversity - 

There has also been much interest in combinatorial 
30 libraries based on small molecules, which are more suited to 
pharmaceutical use, especially those which, like 
benzodiazepines, belong to a chemical class which has 
already yielded useful pharmacological agents. The 
techniques of combinatorial chemistry have been recognized 
35 as the most efficient means for finding small molecules that 
act on these targets. At present, small molecule 
combinatorial chemistry involves the synthesis of either 
pooled or discrete molecules that present varying arrays of 
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functionality on a common scaffold. These compounds are 
grouped in libraries that are then screened against the 
target of interest either for binding or for inhibition of 
biological activity. 
5 The size of a library is the number of molecules in it. 

The simple diversity of a library is the number of unique 
structures in it. There is no formal minimum or maximxam 
diversity. If the library has a very low diversity, the 
library has little advantage over just synthesizing and 

10 screening the members individually. If the library is of 
very high diversity, it may be inconvenient to handle, at 
least without automatizing the process. The simple 
diversity of a library is preferably at least 10, 10E2, 
10E3, 10E4, 10E6, 10E7, 10E8 or 10E9/ the higher the better 

15 under most circiunstances . The simple diversity is usually 
not more than 10E15, and more usually not more- than lOElO. 

The average sampling level is the size divided by the 
simple diversity. The expected average sampling level must 
be high enough to provide a reasonable assurance that/ if a. 

20 given structure were expected, as a consequence of the 
library design, to be present, that the actual average 
sampling level will be high enough so that the structure, if 
satisfying the screening criteria, will yield a positive 
result when the library is screened. Thus, the preferred 

25 average sampling level is a function of the detection limit, 
which in turn is a function of the strength of the signal to 
be. screened. 

There are more complex measures of diversity than 
simple diversity. These attempt to take into account the 

30 degree of structural difference between the various unique 
sequences- These more complex measures are usually used in 
the context of small organic compound libraries, see below. 

The library members may be presented as solutes in 
solution, or immobilized on some form of support. In the 

35 latter case, the support may be living (cell, virus) or 

nonliving (bead, plate, etc.). The supports may be separable 
(cells, virus particles, beads) so that binding and 
nonbinding members can be separated, or nonseparable 
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(plate) . In the latter case, the members will normally be 
placed on addressable positions on the support. The 
advantage- of a soluble library is that there is no carrier 
moiety that could interfere with the binding of the members 
5 .to the support. The advantage of an immobilized library is 
that it is easier to identify the structure of the members 
which were positive. 

When screening a soluble library, or one with a 
separable support, the target is usually immobilized. When 
10 • screening a library on a nonseparable support, the target 
will usually be labeled. 

Oligonucleotide Libraries 

An oligonucleotide library is .a combinatorial library, 

15 at least some of whose members are single-stranded 

oligonucleotides having three or more nucleotides connected 
by phosphodiester or analogous bonds. The oligonucleotides 
may be linear, cyclic or branched, and may include non- 
nucleic acid moieties . The nucleotides are not limited to 

20 the nucleotides normally found in DNA or RNA.- For examples . 
. of nucleotides modified to increase nuclease resistance and 
chemical stability of aptamers, see Chart 1 in Osborne and 
Ellington, Chem. Rev., 97: 349-70 (1997). For screening of 
RNA, see Ellington and Szostak, Nature, 346: 818-22 (1990) . 

25 There is no formal minimum or maximum size for these 

oligonucleotides. However, the nvimber of conformations which 
an oligonucleotide can assume increases exponentially with 
its length in bases- Hence, a longer oligonucleotide is 
more likely to be able to fold to adapt itself to a protein 

30 surface. On the other hand, while very long molecules can 
be synthesized and screened, unless they provide a much 
superior affinity to that of shorter molecules, they are not 
likely to be found in the selected population, for the 
reasons explained by Osborne and Ellington (1997) . Hence, 

35 the libraries of the present invention are preferably 

composed of oligonucleotides having a length of 3 to 100 
bases, more preferably 15 to 35 bases- The oligonucleotides 
in a given library may be of the same or of different 
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lengths . 

Oligonucleotide libraries have the advantage that 
libraries of very high diversity (e.g., 10") are feasible, 
and binding molecules are readily amplified in vitro by 
5 polymerase chain reaction (PGR) . Moreover, nucleic acid 
molecules can have very high specificity and affinity to 
targets . 

In a preferred embodiment, this invention -prepares and 
screens oligonucleotide libraries. by the SELEX method, as 
10 described in King and Famulok, Molec. Biol. Repts., 20: 97- 
107 (1994); L. Gold, C. Tuerk. Methods of producing nucleic 
acid ligands, US#5595877; Oliphant et al. Gene 44:177 
(1986) . 

The terra "aptamer" is conferred on those 
.15 oligonucleotides which bind the target protein. Such 

aptamers may bemused to characterize the target protein, 
both directly (through identification of the aptamer and the 
points of contact between the aptamer and the protein) and 
indirectly (by use of the aptamer as a ligand to modify the 
20 chemical reactivity of the protein) . 

In a classic oligonuclotide, each nucleotide (monomeric 
unit) is composed of a phosphate group, a sugar moiety, and 
either a purine or a pyrimidine base. In DNA, the sugar is 
deoxyribose and in RNA it is ribose. The nucleotides are 
25 linked by 5 '-3* phosphodiester bonds. 

The deoxyribose phosphate backbone of DNA can be 
modified to increase resistance to nuclease and to increase 
penetration of cell membranes. Derivatives such as mono- or 
dithiophosphates, methyl phosphonates, boranophospHates, 
30 formacetals, carbamates, siloxanes, and dimethylenethio- -* 
sulfoxideo- and-sulfono- linked species are known in the 
art. 

Peptide Library 

35 A peptide is composed of a plurality of amino acid 

residues joined together by peptidyl (-NHCO-) bonds. A 
biogenic peptide is a peptide in which the residues are all 
genetically encoded amino acid residues; it is not necessary 
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that the biogenic peptide actually be produced by gene 
expression. 

Amino acids are the basic building blocks with which 
peptides and proteins are constructed. Amino acids possess 
• 5 both an amino group (-NH2) and a carboxylic acid group (- 

COOH) . Many amino acids, but not all, have the alpha amino 
acid structure NHg-CHR-COOH/ where R is hydrogen, or any of c 
variety of functional groups. 

Twenty amino acids are genetically encoded: Alanine, 

10 Arginine, Asparagine, Aspartic Acid, Cysteine, Glutamic 

Acid, Glutamine, Glycine, Histidine, Isoleucine, Leucine, 
Lysine, Methionine, Phenylalanine, Proline, Serine, 
Threonine, Tryptophan, Tyrosine, and Valine. Of these, all 
save Glycine are optically isomeric, however, only. the L- 

15 form is found in humans. Nevertheless, the D-forms of these 
amino acids do have biological significance; D-Phe, for 
example, is a known analgesic. • 

. Many other amino acids are also known, including: 2- 
Aminoadipic acid; 3-Aminoadipic acid; beta-Aminopropionic 

20 acid; 2-Aminobutyric acid; 4-Aminobutyric acid (Piperidinic 
acid) ; 6-Aminocaproic acid; 2-Aminoheptanoic acid; 2- 
Aminoisobutyric acid, 3-Aminoisobutyric acid; 2'-Aminopimelic 
acid; 2, 4-Diaminobutyric acid; Desmosine; 2,2'- 
Diaminopimelic acid; 2, 3-Diaminopropionic acid; N- 

25 Ethylglycine; N-Ethylasparaginer Hydroxylysine; alio- 
• Hydroxylysine; 3-Hydroxyproline; 4-Hydroxyproline; 

Isodesmosine; allo-Isoleucine; N-Methylglycine (Sarcosine) ; 
N-Methylisoleucine; N-Methyl valine; Norvaline; Norleucine; 
and Ornithine - 

30 Peptides are constructed by condensation of amino acids 

and/or smaller peptides. The amino group of one amino acid 
(or peptide) reacts with the carboxylic acid group of a 
second amino acid (or peptide) to form a peptide (-NHCO-) 
bond, releasing one molecule of water. Therefore, when an 

35 amino acid is incorporated into a peptide, it should, 
technically speaking, be referred to as an amino acid 
residue . The core of that residue is the moiety which 
excludes the -NH and -CO linking functionalities which 
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connect it to other residues. This moiety consists of one 
or more main chain atoms (see below) and the attached side 
chains . 

The main chain moiety of each amino acid consists of 
5 the -NH and -CO Linking functionalities and a core main 

•chain moiety. Usually the latter is a single carbon atom. 
However, the core main chain moiety may include additional 
carbon atoms, and may also include nitrogen, oxygen or 
•sulfur atoms, which together form. a single chain. In a 

10 preferred embodiment, the core main. chain atoms consist 
solely of carbon atoms . 

The side chains are attached to the core main chain 
atoms. For alpha amino acids, in which the side chain is 
attached to the -alpha carbon, the C-1, C-2 and N-2 of each 

15 residue form the repeating unit of the main chain, and the 
word ^'side chain'' refers to the C-3 and higher numbered 
carbon atoms and their substituents . It also includes H 
atoms attached to the main chain atoms . 

Amino acids may be classified according to the number 

20 of carbon atoms which appear in the main chain between the 
carbonyl carbon and amino nitrogen atoms which participate 
in the peptide bonds. Among the 150 or so amino acids which 
occur in nature, alpha, beta, gamma and delta amino acids 
are known. These have 1-4 intermediary carbons. Only alpha 

25 amino acids occur in proteins. Proline is a special case of 
an alpha amino acid; its side chain also binds to the 
peptide bond nitrogen. : 

For beta and higher order amino acids, there is a 
choice as to which main chain core carbon a side chain other 

30 than H is attached to- The preferred attachment site is the 
C-2 (alpha) carbon, i.e,, the one adjacent to the carboxyl 
carbon of the -CO linking functionality. It is also possible 
for more than one main chain atom to carry a side chain 
other than H- However, in a preferrred embodiment, only one 

35 main chain core atom carries a side chain other than H. 

A main chain carbon atom may carry either one or two 
side chains; one is more common. A side chain may be 
attached to a main chain carbon atom by a single or a double 
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bond; the former is more common. 

A simple combinatorial peptide library is one whose 
members are peptides having three or more amino acids 
connected via peptide bonds. 
5 The peptides may be linear, branched/ or cyclic, and " 

may covalently or noncovalently include nonpeptidyl 
moieties. The amino acids are not limited to the naturally 
occurring or to the genetically encoded amino acids. 

A biased peptide library is one in which one or more 
10 (but not all)- residues- of the peptides are constant 
residues • 

Cyclic Peptides 

Many naturally occurring peptides are cyclic. 

15 Cyclization is a common mechanism for stabilization of 

peptide .conformation thereby achieving improved association 
of the peptide with its ligand and hence improved biological 
activity* Cyclization is usually achieved by intra-chain 
cystine formation, by formation of peptide bond between side 

20 chains or between N- and C- terminals. Cyclization was 
usually achieved by peptides in solution, but several 
publications have appeared that describe cyclization of 
peptides on beads . * . 

A peptide library may be an oligopeptide library or a 

25 protein library. 

Ol i gopep tides 

Preferably, the oligopeptides are at least five, six, 
seven or eight amino acids in length. Preferably, .they are 
30 composed of less than 50, more preferably less than 20 amino 
acids. 

In the case of an oligopeptide library, all or just 
some of the residues may be variable. The oligopeptide may 
be unconstrained, or constrained to a particular 
35 conformation by, e.g., the participation of constant 
cysteine residues in the formation of a constraining 
disulfide bond. 
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ProtBins 

Proteins / like oligopeptides, are composed of a 
plurality of amino acids, but the term protein is usually 
reserved for longer peptides, which are able to fold into a . 
5 stable conformation. A protein may be composed of two or 
more polypeptide chains, held together by covalent or 
noncovaleni: crosslinks. 

A peptide is considered a protein if it (1) is at least 
50. amino acids long, or (2) has at least two stabilizing 
10 covalent crosslinks (e.g., disulfide bonds). Thus, 
conotoxins are considered proteins . 

Usually, the proteins of a protein library will be 
characterizable as having both constant residues (the same 
for all proteins in the library) and variable residues 
15 (which vary from member to member) . This is simply because, 
for a given range of variation at each position, the 
sequence space (simple diversity) grows exponentially with 
the number of residue positions, so at some point it becomes 
inconvenient for all residues of a peptide to be variable 
20 positions. Since proteins are usually larger than 

oligopeptides, it is more common for protein libraries than 
oligopeptide libraries to feature variable positions. 

In the case of a protein library, it is desirable to 
focus the mutations at those sites which are tolerant of 
25 mutation. These may be determined by alanine scanning 

mutagenesis or by comparison of the protein sequence to that 
of homologous proteins of similar activity. It is also more 
likely that mutation of surface residues will directly 
affect binding. Surface residues may be determined by 
30 inspecting a 3D structure of the protein, or by labeling the 
surface and then ascertaining which residues have received 
labels. They may also be inferred by identifying regions of 
high hydrophilicity within the protein. 

Because proteins are often altered at some sites but 
35 not others, protein libraries can be considered a special 
case of the biased peptide library. 

There are several reasons that one might screen a 
protein library Instead of an oligopeptide library. 
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including (1) a particular protein/ mutated in the library, 
has the desired activity to some degree already, and (2) the 
oligopeptides are not expected to have a sufficiently high 
affinity or specificity since they do not have a stable 
5 conf ormation. 

When the protein library is based on a parental prptein 
which does not have the desired activity, the parental 
protein will usually' be one which is of high stability 
(melting point >= 50 deg. C.) and/or possessed of 

10 hypervariable regions . • . • 

The variable domains of an antibody possess 
hypervariable regions and hence, in some embodiments, the 
protein library comprises members which comprise- a mutant of 
VH or VL chain, or a mutant of an antigen-specific binding 

15 fragment of such a chain. VH and VL chains are usually each 
about 110 amino acid residues, and are held in proximity by • 
a disulfide bond between the adjoing CL and CHI regions to 
form a variable domain. Together, the VH, VL, CL and CHI 
form an Fab fragment . 

20 In human heavy chains, thfe hypervariable regions are at 

31-35, 49-65, 98-111 and 84-88, but only the first three are 
involved in antigen binding. There is variation among VH 
and VL chains at residues outside the hypervariable regions, 
but to a much lesser degree. 

25 A sequence is considered a mutant of a VH or VL chain 

if it is at least 80% identical to a naturally occurring VH 
or VL chain at all residues outside the hypervariable 
region. 

In a preferred embodiment, such antibody library 
30 members comprise both at least one VH chain and at least one 
VL chain, at least one of which is- a mutant chain, and which 
chains may be derived from the same or different antibodies. 
The VH and VL chains may be covalently joined by a suitable 
linker moiety, as in a ^^single chain antibody"^, or they may 
35 be noncovalently joined, as in a naturally occurring 
variable domain. 

If the joining is noncovalent, and the library is • 
displayed on cells or virus, then either the VH or the VL 
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chain may be fused to the carrier surface/coat protein • The 
complementary chain may be co-expressed, or added 
exogenously to the library. 

The members may further comprise some or all of an 
antibody constant heavy and/or constant light chain, or a 
mutant thereof. 

Peptoid Library 

A peptoid is an analogue of a peptide in which one or 
more, of the peptide bonds (-NH-CO-) are replaced by 
pseudopeptide bonds,. which may be the same* or different. It 
is not necessary that all of the peptide bonds be replaced, 
i.e., a peptoid may include one or moxe conventional amino 
acid residues,* e.g., proline. 

A peptide bond' has two small divalent linker elements, 
-NH- and -CO-. Thus, a preferred class of psixedopeptide 
bonds are those which consist of two small divalent linker * 
elements. Each may be chosen independently from the group 
consisting of amine (-NH-) , substituted amine (-NR-) , 
carbonyl (-CO-), thiocarbonyl {-CS-) , methylene (-CH2-) , 
monosubstituted methylene (-CHR-) , disubstituted methylene 
(-CR1R2-), ether {-0-) and thioether (-S-) • The more 
preferred pseudopeptide bonds inclu.de: 

N-modified -NRCO- 

Carba ^ -CHj-CHj- 

Depsi W -CO-0- 

Hydroxyethylene W -CHOH-CHj- 

Ketomethylene W -CO-CHg- 

Methylene-Oxy -CHg-O- 

Reduced -CH2-NH- 

Thiomethylene -CHg-S- 

Thiopeptide -CS-NH- 

Retro-Inverso -CO-NH- 

A single peptoid molecule may include more than one 
kind of pseudopeptide bond. 

For the purposes of introducing diversity into a * 
peptoid library, one may vary (1) the side chains attached 
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to the core main chain atoms of the monomers linked by the 
pseudopeptide bonds ^ and/or (2) the side chains (e.g., the - 
R of an -NRCO-) of the pseudopeptide bonds. Thus, in one 
embodiment, the monomeric units \^hich are not amino acid 
5 residues are of the structure -NR1-CR2-CO-, where at least 
one of Rl and R2 are not hydrogen. If there is variability 
in the pseudopeptide bond, this is most conveniently done by 
using an -NRCO- or other pseudopeptide bond with an R group, 
and varying the R group. In thi.s event, the R group will 

10 usually be any of the side chains characterizing the amino 
acids of peptides, as previously discussed. 

If the R group of. the pseudopeptide bond is not 
variable, it will usually be small, e.g», not more than 10 
atoms (e.g., hydroxyl, amino, carboxyl, methyl, ethyl, 

15 propyl) . 

If the conjugation chemistries are compatible, a simple 
combinatorial library may include both peptides and 
peptoids . 

20 Peptide Nucleic Acid Library 

A PNA oligomer is here defined as one comprising a 
plurality of units, at least one of which is a PNA monomer 
which comprises a side chain comprising a nucleobase. For 
nucleobases, see USP 6,077,835. 

25 The classic PNA oligomer is composed of (2- 

aminoethyl) glycine units, with nucleobases attached by 
methylene carbonyl linkers. * That is,* it has the structure 

H- ( -HN-CH2-CH2-N ( -CO-CHjT-B ) -CH2-CO- ) a -OH 

30 

where the outer parenthesized substructure is the PNA 
monomer . 

In this structure, the nucleobase B is separated from 
35 the backbone N by three bonds, and the points of attachment 
of the side chains are separated by six bonds. The 
nucleobase may be any of the bases included in the 
nucleotides discussed in connection with oligonucleotide 
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libraries. The bases of nucleotides A, G, C and U are 
preferred. 

A PNA oligomer may further comprise one or more amin'o 
acidl residues, especially glycine and proline. 
5 One can readily envision related molecules in which (1) 

the -C0CH2- linker is replaced by another linker, especially 
one composed of two small divalent linkers as defined 
previously, (2) a side chain is attached to one of the three . 
• main chain carbons not participating in the peptide bond 
10 (either instead or in addition .to the. side chain attached to 
the N of the classic PNA) ; and/or (3) the peptide bonds are 
replaced by pseudopeptide bonds as disclosed previously in 
the context of peptoids. 

PNA oligomer libraries have been made; see e.g. Cook, 
15 6,204,326. 

Small Organic Compound Librarv 

The small organic compound library ("compound library", 
for short) is a combinatorial library whose members are 

20 suitable for use as drugs if, indeed, they have the ability 
to mediate a biological activity of the target protein. 

Peptides have certain disadvantages as drugs- These 
include susceptibility to degradation by serum proteases, 
and difficulty in penetrating cell membranes. Preferably, 

25 all or most of the compounds of the compound library avoid, 
or at least do not suffer to the same degree, one or more of 
the pharmaceutical disadvantages of peptides. 

In designing a compound library, it is helpful to bear 
in mind the methods of molecular modification typically used 

30 to obtain new drugs. Three basic kinds of modification may 
be identified: disjunction , in which a lead drug is 
simplified to identify its component pharmacophoric 
moieties; conjunction ^ in which two or more known 
pharmacophoric moieties, which may be the same or different, 

35 are associated, covalently or noncovalently, to form a new 
drug; and alteration , in which one moiety is replaced by 
another which may be similar or different, but which is not 
in effect a disjunction or conjunction. The use of the 
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terms "disjunction", "conjunction" and "alteration" is 
intended only to connote the structural relationship of the 
end product to the original leads, and not how the new drugs 
are actually synthesized, although it is possible that the 
5 two are the same. 

The process of disjunction is illustrated by the 
evolution of neostigmine (1931) and edrophonium (1952) from 
physostigmine (1925) . Subsequent conjunction is illustrated 
•by demecarixim. (1956) and ambenonium (1956) . 

10, • Alterations may modify the size, polarity, or electron 

distribution of an original moiety. Alterations include 
ring closing or opening, formation of lower or higher 
homologues, introduction or saturation of double bonds, 
introduction of optically active * centers , introduction, 

15 removal or replacement of bulky groups, isosteric or 
bioisosteric substitution, changes in the. position or 
orientation of a group, introduction of alkylating groups, 
and introduction, removal or replacement of groups with a 
view toward inhibiting or promoting inductive 

20 (electrostatic) or conjugative (xesonance) effects. 

Thus, the substituents may include electron acceptors 
and/or electron donors. Typical electron donors (+1) 
include -CH3, -CHgR, -CHR2, -CR3 and -COO". Typical electron 
acceptors (-1) include -NH3+, -NR3+, -NO2, -ON, -COOH, -COOR, 

25 -CHO, -COR, -COR, -F, -CI, -Br, -OH, -OR, -SH, -SR, -CH=CH2, 
-CR==CR2 , and -C=CH . 

The substituents may also include those which increase 
or decrease electronic density in conjugated systems. The 
former (+R) groups include -CH3, -CR3, -F,*-C1, -Br, -I, -OH, 

30 -OR, -OCOR, -SH, -SR, -NH2, -NRj, and -NHCOR. The later (-R) 
groups include -NOg, -CN, -CHC, -COR, -COOH, -COOR, -CONHj, 
-SOjR and -CF3. 

Synthetically speaking, the modifications may be 
achieved by a variety of unit processes, including 

i5 nucleophilic and electrophilic substitution, reduction and 
oxidation, addition elimination, double bond cleavage, and 
cyclization. 

For the purpose of constructing a library, a compound. 
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or a family of compounds, having one or more pharmacological 
activities (which need not be related to the known or 
. suspected activities of the target protein) , may be 
disjoined into two or more .known or potential pharmacophoric 
5 moieties. Analogues of each of these moieties may be 

identified, and mixtures of these analogues reacted so as to 
reassemble compounds which have some similarity to the 
original lead compound. It is not necessary that all 
members of the library possess moieties analogous to all of 

10* the moieties of the lead compound. 

The design* of a library may be illustrated by the 
example of the benzodiazepines. Several benzodiazepine 
drugs, including chlordiazepoxide, diazepam and oxazepam, 
have been used as anti-anxiety drugs. Derivatives of 

15 benzodiazepines have widespread biological activities; 
derivatives have been reported to act not only as 
anxiolytics, but also as anticonvulsants; cholecystokinin 
(CCK) receptor subtype A or B, kappa opioid receptor, 
platelet activating factor, and HIV transactivator Tat 

20 antagonists, and GPIIblla, reverse transcriptase and ras 
. farnesyltransf erase inhibitors . 

The benzodiazepine structure has been disjoined into a 
2-aminobenzophenone, an amino acid, and an alkylating agent. 
See Bunin, et al., Prpc. Nat. Acad. Sci. USA, 91:4708 

25 (1994) - Since only a few 2-aminobenzophenone derivatives 
are commercially available, it was later disjoined into 2- 
aminoarylstannane, an acid chloride, an amino acid, and an 
alkylating agent, Bunin, et al., Meth. Enzymol., 267:448 
(1996) . The arylstannane may be considered the core 

30 structure upon which the other moieties are substituted, or 
all four may be considered equals which are conjoined to 
make each library member, 

A basic library synthesis plan and member structure is 
shown in Figure 1 of Fowlkes, et al., U.S. Serial No. 

35 08/740,671, incorporated by reference in its entirety.- The 
acid chloride building block introduces variability at the 
site. The site is introduced by the amino acid, and the 
R^ site by the alkylating agent. The R* site is inherent in 
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the arylstannane. Bunin, et al. generated a 1, 4- 
benzodiazepine library of 11,200 different derivatives 
prepared from 20 acid chlorides, 35 amino acids, and 16 
alkylating agents. (No diversity was introduced at R*; this 
group was used to couple the molecule to a solid phase.) 
According to the Available Chemicals Directory (HDL 
Information Systems, San Leandro CA) , over ,300 acid 
chlorides, 80 Fmoc-protected amino acids and 800 alkylating 
agents were available for purchase (and more, of course, 
could be synthesized) . The particular moieties used were 
chosen to maximize structural dispersion, while limiting the 
numbers to those conveniently synthesized in the wells of a 
microtiter plate. In choosing between structurally similar 
compounds, preference was given to the least substituted 
compound . 

The variable elements included both aliphatic and 
aromatic groups. Among the aliphatic groups, both acyclic 
and cyclic (mono- or poly-) structures, substituted or not, 
were tested. (While all of the acyclic groups were linear, 
it would have been feasible to introduce a branched 
aliphatic) . The aromatic groups featured either single and 
multiple rings, fused or not, substituted or not, and with 
heteroatdms or not. The secondary substitutents included - 
NH2, -OH, -OMe, -CN, -CI, -F, and -COOH. While not used, 
spacer moieties, such as -S-, -00-, -CS-, -NH-, and - 

NR-, could have been incorporated. 

Bunin et al. suggest that instead of using a 1, 4- 
benzodiazepine as a core structure, one may instead use a 1, 
4-benzodiazepine-2, 5-dione structure. 

As noted by Bunin et al., it is advantageous, although 
'not necessary, to use a linkage strategy which leaves no 
trace of the linking functionality, as this permits 
construction of a more diverse library. 

Other combinatorial nonoligomeric compound libraries 
known or suggested in the art have been based on carbamates, 
mercaptoacylated pyrrolidines, phenolic agents, aminimides, 
N-acylamino ethers (made from amino alcohols, aromatic 
hydroxy acids, and carboxylic acids), N-alkylamino ethers 
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(made from aromatic hydroxy acids, amino alcohols and 
aldehydes) 1, 4-piperazines> and 1, 4-piperazine-6-ones . 

DeWitt/ et al., Proc. Nat. Acad. Sci. (USA), 90:6909^-13 
(1953) describe the simultaneous but separate, synthesis of 
40 discrete hydantoins and 40 discrete benzodiazepines. 
They carry out their synthesis on a solid support (inside a 
gas dispersion tube) , in an array format, as opposed to 
other conventional simultaneous synthesis techniques (e.g., 
in a well, or on a pin) . The hydantoins were synthesized by 
first simultaneously deprotecting and then treating each of 
five amino acid resins with each of eight isocyanates. The 
benzodiazepines were synthesized by treating each of five 
deprotected amino acid resins with each of eight 2-amino 
benzophenone imines . 

Chen, et al., J. Am. Chem. See, 116:2661-62 (1994) 
described the preparation of a pilot (9 member) 
combinatorial library of formate esters. A polymer bead- 
bound aldehyde preparation was "split" into three aliquots, 
each reacted with one of three different ylide reagents. 
The reaction products were combined, and then divided into 
three new aliquots, each of which was reacted with a 
different Michael donor. Compound identity was found to be 
determinable on a single bead basis by gas 
chromatography/mass spectroscopy analysis. 

Holmes,. USP 5,549,974 (1996) sets forth methodologies 
for the combinatorial synthesis of libraries of 
thiazolidinones and metathiazanones . These libraries are 
made by combination of amines, carbonyl compounds, and 
thiols under cyclization conditions. 

Ellman, USP 5,545,568 (1996) describes combinatorial 
synthesis of benzodiazepines, prostaglandins, beta-turn 
mimetics, and glycerol-based compounds. See also Ellman, 
USP 5,288,514. 

Summerton, USP 5,506,337 (1996) discloses methods of 
preparing a combinatorial library formed predominantly of 
morpholino subunit structures . 

Heterocylic combinatorial libraries are reviewed 
generally in Nefzi, et al., Chem. Rev., 97:449-472 (1997). 
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One or more moieties of the following types may be 
incorporated into compounds of the library, as many drugs 
fall into one or more of the following categories: 

acetals 

acids 

alcohols 

amides 

amidines 

amines 

amino acids 

amino alcohols 

amino ethers 

amino ketenes 

ammonium compounds 

azo compounds 

enols 

esters 

ethers 

glycosides 

guanidines 

halogenated compounds 

hydrocarbons 

ketones 

lactams 

lactones 

mustards 

nitro compounds 

nitroso compounds 

organo minerals 
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phenones • 

quinones 

semlcarbazones 

stilbenes 

sulfonamides 

sulfones 

thiols 

thioamides 

thioureas 

ureas 

ureides 

urethans 

Without attempting to exhaustively recite all 
pharmacological classes of drugs, or all drug structures, 
one or more compounds of the chemical structures listed 
below have been found to exhibit the indicated 
pharmacological activity, and these structures, or 
derivatives, may be used as design elements in screening ; 
further compounds of the same or different activity. (In 
some cases, one or more lead drugs of the class are 
indicated. ) 

hypnotics 

higher alcohols (clomethiazole) 

aldehydes (chloral hydrate) 

carbamates (meprobamate) 

acyclic ureides (acetyicarbromal) 

barbiturates (barbital) 

benzodiazepine (diazepam) 

an t icon vul s a n t s 

barbiturates (phenobarbital) 
hydantoins (phenytoin) 
oxazolidinediones (trimethadione) 
succinimides (phensuximide ) 
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acylureides (phenacemides) 

narcotic analgesics 
morphines 

phenylpiper idines (meperidine ) 
diphenylpropyl amines (methadone) 
phenothiazihes (methotrimeprazine) 

analgesics, antipyretics, antirheumatics 
salicylates (acetylsalicylic acid) 
p-aminophenol (acetaminophen) 
5-pyrazolone (dipyrone) 

3/ 5-pyrazolidinedione (phenylbutazone) 
arylacetic acid (indomethacin) 

adrenocortical steroids (cortisone, dexamethasone, 
prednisone, triamcilone) 
athranilic . acids 

neuroleptics 

phenothiazine (chlorpromazine) 
thioxanthene ( chlorprothixene ) 
reserpine 

butyrophenone (halopendol ) 

anxiolytics 

propandiol carbamates (meprobamate) 
benzodiazepines ' (chlordiazepoxide, diazepam, 

oxazepam) 

antidipres s ants 

tricyclics ( imipramine ) 

muscle/relaxants 

propanediols and carbamates (mephenesin) 

CNS stimulants 

xanthines (caffeine, theophylline) 
phenylalkylamines (amphetamine) 
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(Fenetylline is a conjunction of theophylline and 
amphetamine) 

oxazolidinones (pemoline) 
cholinergics 

choline esters (acetylcholine) 
N,N-dimethylcarbamates 

adrenergics 

aromatic amines (epinephrine,, isoproterenol, 

phenylephrine) 
alicyclic amines (cyclopent amine) 
aliphatic amines (methylhexaneamine) 
imidazolines (naphazoline) 

anti-adrenergics 

indole thyl amine alkaloids (dihydroergotamine) 

imidazoles (tolazoline) 

benzodioxans (piperoxan) 

beta-haloalkylamines (phenoxybenzamine) 

dibenzazepines (azapetine) 

hydra zinophtha 1 a z ine s ( hy dr al a zine ) 

antihistamines 

ethanolamine s ( diphenhydramine ) 
ethylenediamines (tripelennomine) 
alkylamines (chlorpheniramine) 
piperazines (cyclizine) 
phenothiazines (promethazine) 

local anesthetics 
benzoic acid 

esters (procaine, isobucaine, cyclomethycaine) 
basic amides (dibucaine) 

anilides, toluidides, 2, 6-xylidides (lidocaine) 
tertiary amides (oxetacaine) 

vasodilators 

polyol nitrates (nitroglycerin) 
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diuretics 

xanthines 

thiazides (chlorothiazide) 
sulfonamides (chlorthalidone) 

5 

antihelmintics 

cyanine dyes 



antimalarials 

10 4*-aminoquinolines 

8-aminoquinolines 
pyrimidines 
biguanides 
acridines 

15 dihydrotriazines 

sulfonamides 
sulfones 



antibacterials 
20 antibiotics 

penicillins 
cephalosporins 

octahydronapthacenes (tetracycline) 
sulfonamides 
25 nitrofurans 

cyclic amines 

naphthyridines 

xylenols 



30 antitumor 

alkylating agents 
nitrogen mustards 
aziridines 

methanesulfonate esters 
35 epoxides 

amino acid antagonists 
folic acid antagonists 
pyrimidine antagonists 
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i antagonists 



• .:tanes 

5 .-sides 

-luicarbazones 

• r.es 

v.es and guanidines 
.nolines 

10 .idazoles 

- .:2ines • 

Tological classes, see, e,g., Goth/ Medical 
-inciples and Concepts (C.V. Mosby Co.: 8th 
15 Lkovas and Burckhalter, Essentials of 

• ; try (John Wiley & Sons, Inc.: 1976). For * 
-.3, see, e.g.. Warren, Organic Synthes is: The 
o oroach (John Wiley & Sons, Ltd.: 1982); 
I ' of Organic Compounds (John Wiley & Sons: 

20 : 5 Payne, How to do an Organic Synthesis 

.1, Inc.: 1969); Greene, Protective Groups in 
C \s, (Wiley-Interscience) . For selection of 

c -.e e.g., Hansch and Leo, Substituent 

C - rrelation Analysis in Chemistry a nd Biology 

25 . .ns: 1979) . 

■f is preferably synthesized so that the 
i • . * .rrs remain identifiable so that, if a member 

i -.ctive, it is not necessary to analyze it. 

£ of identification have been proposed, 

30 i 

ing, i.e., the attachment to each member of 
:ntifier moiety which is more readily 
'.fied than the member proper. This has the 
rantage that the tag may itself influence the 
35 Ity of the conjugate. 

:1 addressing, e.g., each member is 
-.sized only at a particular coordinate on or 
. ...atrix, or in a particular chamber. This 
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might be, for example, the location of a 
particular pin, or a particular well on a 
microtiter plate, or inside a "tea bag". 
The present invention is not limited to any particular form 
5 of identification. 

However, it is possible to simply characterize those 
members of the library which* are found to be active, .based 
on the characteristic spectroscopic indicia of the various 
building blocks. 
10 • Solid phase synthesis permits greater control over • 

which derivatives are formed. However, the solid phase 
could interfere with activity. To overcome this problem, 
jsome or all of the molecules of each member could be 
liberated, after synthesis but before screening. 
15 . Examples of candidate simple libraries which* laight .be 

evaluated include derivatives of the following: 
Cyclic Compounds Containing One Hetero Atom 
Heteronitrogen 
pyrroles 

20 pentasubstituted pyrroles 

pyrrolidines 
pyrrolines 
prolines 
indoles 

25 beta-carbolines 

pyridines 

dihydropyridines 

1, 4-dihydropyridines 

pyrido [ 2 , 3-d] pyrimidines 
30 tetrahydro-3H-imidazo[4, 5-c] pyridines 

Isoquinolines 

tetrahydroisoquinolines 
quinolones 
beta-lactams 

35 azabicyclo(4.3.0]nonen-8-one amino acid 

Heterooxygen 
furans 

tetrahydrofurans 
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2, S-disubstituted tetrahydrofurans 

pyrans 

hydroxypyranones 
t et r ahydroxypy r anone s 
5 gamma -bu t y r ol a ct one s 

Heterosulf ur 

sulfolenes 

Cyclic Compounds with. Two or More Hetero atoms 
Multiple heteronitrogens 
10 imidazoles 

pyrazoles 
piperazines 

diketopiperazines 
arylpiperazines 
15 benzylpiperazines 

benzodiazepines 
1 , 4-benzodiazepine-2 , 5-diones 
hydantoins 

5-alkoxyhydantoins 
20 dihydropyrimi dines 

1, 3-disubstituted-5/ 6-dihydopyrimidine-2^ 4- 

diones 
cyclic ureas 
25 cyclic thioureas 

quinazolines 

chiral 3-substituted-quinazoline-2 , 4- 

diones 

triazoles 

30 1, 2, 3-triazoles 

purines 

Heteronitrogen and Heterooxygen 
dikelomorpholines 
isoxazoles 
35 isoxazolines 

Heteronitrogen and Heterosulfur 
thiazolidines 

N-axylthiazolidines 
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dihydrothiazoles 

2-methylene-2^ S-dihydrothiazates 

2-aminothiazoles 
thiophenes 

5 3 -amino thiophenes 

4-thiazolidinones 
4-inelathiazanones 
benz i s o thi a zolone s 
For details on synthesis. of libraries^ see Nefzi, et 
10 al., Chem. Rev., 97:449-72 (1997.), and references cited 
therein . 

Sharmaceu'blcal Metliods and Preparations 

The preferred animal subject of the present invention 

15 is a mammal. By the term "mammal" is meant an individual • 
belonging to the class Mammalia. The invention is 
particularly useful in the treatment of human subjects, 
although it is intended for veterinary and nutritional uses 
as well. Preferred nonhuman subjects are of the orders 

20 Primata (e.g., apes and monkeys) , Artiodactyla or 

Perissodactyla (e.g., cows, pigs, sheep, horses, goats), 
Carnivora (e.g., cats, dogs), Rodenta (e.g., rats, mice, 
.guinea pigs, hamsters), Lagomorpha (e.g., rabbits) or other 
pet, farm or laborati>ry mammals. 

25 The term "protection", as used herein, is intended to 

include "prevention," "suppression" and "treatment." 
"Prevention", strictly speaking, involves administration of 
the pharmaceutical prior to the induction of the disease, (or 
other adverse clinical condition) . "Suppression" involves 

30 administration of the composition prior t o the clinical 
appearance of the disease. "Treatment" involves 
administration of the protective composition after the, 
appearance of the disease. 

It will be understood that in human and veterinary 

35 medicine, it is not always possible to distinguish between 
"preventing" and "suppressing" since the ultimate inductive 
event or events may be unknown, latent, or the patient is 
not ascertained until well after the occurrence of the event 
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or events. Therefore/ unless qualified/ the term 
"prevention" will be understood to refer to both prevention 
in the strict sense/ and to suppression. 

The preventative or prophylactic use of a 
5 pharmaceutical .involves identifying subjects who are at 

higher risk than the general population of contracting the 
disease/ and administering the pharmaceutical to them in 
advance of the clinica^l appearance of the disease. The 
effectiveness of such use is measured by comparing the ' 
10 • subsequent incidence or severity of the disease, or* of 

particular symptoms of the disease, in the treated subjects 
against that in untreated subjects of the same high risk 
group . 

While high risk factors vary from disease to disease, 

15 in general, these include (1) prior occurrence of the 

disease in one or more members of the same family, or, in 
the case of a contagious disease, in individuals with whom 
the subject has come into potentially contagious contact at 
a time when the earlier victim was likely to be contagious, 

20 (2) a prior occurrence of the disease in the subject, (3) 

prior occurrence of a related disease, or a condition known 
to increase the likelihood of the disease, in the subject; 
(4) appearance of a suspicious level of a marker of the 
disease, or a related disease or condition; (5) a subject 

25 who is immunologically compromised, e.g., by radiation 

treatment, HIV infection, drug use,, etc., or (6) membership 
in a particular group (e.g., a particular age, sex, race, 
ethnic group, etc.) which has been epidemiologically 
associated with that disease. 

30 A prophylaxis or treatment may be curative, that is, 

directed at the underlying cause of a disease, or 
ameliorative, that is, directed at the symptoms of the 
disease, especially those which reduce the quality of life. 
It should also be understood that to be useful, the 

35 protection provided need not be absolute, provided that it 
is sufficient to carry clinical value. An agent which 
provides protection- to a lesser degree than do competitive 
agents may still be of value if the other agents are 
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ineffective for a particular individual, if it can be used 
in condDination with other agents to enhance the level of 
protection, or if it is safer %han competitive agents. It is 
desirable that there be .a statistically significant {p=0.05 

.5 * or less) improvement in the treated subject relative to an 

appropriate untreated control, and it is desirable that this 
improvement be at least 10%, more preferably at least 25%, 
still more preferably at least 50%,. even more preferably at 
least 100%, in some- indicia of the incidence or severity of 
.10 the disease or of at least one symptom of the disease. 

At least one of the drugs of the present invention may 
be administered, by any means that achieve their intended 
purpose,' to protect a subject against a disease or other 
• adverse condition. The form of administration may be 

15 systemic or topical. For example, administration of such a 
composition may be by various parenteral routes such as 
subcutaneous, intravenous , intradermal, intramuscular, 
intraperitoneal, intranasal, transdermal, or buccal routes. 
Alternatively, or concurrently, administration may be by the 

20. oral route. Parenteral administration can be by bolus 
injection or by gradual perfusion over time. 

A typical regimen comprises administration of an 
effective amount of the drug, administered over a period 
ranging from a single dose, to dosing over a period of 

25 hours, days, weeks, months, or years. 

It is understood that the suitable dosage of a drug of 
the present invention will be dependent upon the age, sex, 
health, and weight of the recipient, kind of concurrent 
treatment, if any, frequency of treatment ^ and the nature of 

30 the effect desired. However, the most preferred dosage can 
be tailored to the individual subject, as is understood and 
. determinable by one of skill in the art, without undue 
experimentation. This will typically involve adjustment of 
a standard dose, e.g., reduction of the dose if the patient 

35 has a low body weight. 

Prior to use in humans, a drug will first be evaluated 
for safety and efficacy in laboratory animals. In human 
clinical studies, one would begin with a dose expected to be 
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safe in humans, based on the preclinical data for the drug 
in question, and on customary doses for analogous drugs (if 
any) . If this dose is effective, the dosage may be 
decreased, to determine the minimxam effective dose, if 
5 desired. If this dose is ineffective, it will be cautiously 
increased, with the patients monitored for signs of side 
effects. See, e.g., Berkow et al, eds.. The Merck Manual, 
15th- edition, Merck and Co., Rahway, N.J., 1987; Goodman et 
^1., eds., Goodman and Gilman's The Pharmacologioal Basis of 

10 Therapeutics, 8th edition, Pergamon Press, Inc., Elms.ford, 
N.Y., (1990); Avery's Drug Treatment: Principles and 
Practice of Clinical Pharmacology and Therapeutics/ 3rd 
edition, ADIS Press, LTD., Williams and Wilkins, Baltimore, 
MD. (1987), Ebadi, Pharmacology, Little, Brown and Co., 

15 Boston, (1985) , which references and references cited 

therein, are entirely incorporated herein by reference. " 

The total dose required for each treatment may be 
administered by multiple doses or in a single dose. The 
protein may be administered alone or in conjunction with 

20 other therapeutics directed to the disease or directed to 
other symptoms thereof. 

The appropriate dosage form will depend on the disease, 
. . the pharmaceutical, and the mode of administration; 

possibilities include tablets, capsules, lozenges, dental 

25 pastes, suppositories, inhalants, solutions, ointments and 
parenteral depots. See, e.g., Berker, supra, Goodman, 
supra^ Avery, supra and Ebadi, supra, which- are entirely 
incorporated herein by reference, including all references 
cited therein. 

30 In the case of peptide drugs, the drug may be 

administered in the form of an expression vector comprising 
a nucleic acid encoding the peptide; such a vector, after in 
corporation into the genetic complement of a cell of the 
patient, directs synthesis of the peptide. Suitable vectors 

35 include genetically engineered poxviruses (vaccinia) , 

adenoviruses, adeno-associated viruses, herpesviruses and 
lenti viruses which are or have been rendered nonpathogenic. 

In addition to at least one drug as described herein, a 
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pharmaceutical composition may contain suitable 
pharmaceutically acceptable carriers, such as excipientS/ 
carriers and/or auxiliaries which facilitate processing of 
the active compounds into preparations which can be used 
5 pharmaceutically. See, e.g., Berker/ supra, Goodman, supra, 
Avery, supra and Ebadi, supra, which are entirely 
incorporated herein by reference, included all references 
cited therein. 

10 ' Assay Compositions axidi Methods 

Target Organism 

The invention contemplates that it may be appropriate 
to ascertain or to mediate the biological activity of a 
substance of this invention in a target organism. 
15 The target organism may be a plant, animal, or 

microorganism. 

In the case of a plant, it may be an economic plant, in 
which case the drug may be intended to increase the disease, 
weather or pest resistance, alter the growth 

20 characteristics, or otherwise improve the useful 

characteristics or mute undesirable characteristics of the 
plant. Or it may be a weed, in which case the drug may be 
intended to kill or otherwise inhibit the growth of the 
plant, or to alter its characteristics to convert it from a 

25 weed to an economic plant. The plant may be a tree, shrub, 
crop, grass, etc. The plant may be an algae (which are in 
some cases also microorganisms) , or a vascular plant, 
• especially gymnosperms (particularly conifers) and 

angiosperms. Angiosperms may be monocots or dicots. .The 

30 plants of greatest interest are rice, wheat, corn, alfalfa, 
soybeans, potatoes, peanuts, tomatoes, melons, apples, 
pears, plums, pineapples, fir, spruce, pine, cedar, and oak. 

If the target organism is a microorganism, it may be 
algae, bacteria, fungi, or a virus (although the biological 

35 activity of a virus must be determined in a virus-infected 
cell) . The microorganism may be human or other animal or 
plant pathogen, or it may be nonpathogenic. It may be a 
soil or water organism, or one which normally lives inside 
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other living things. 

If the target organism is an animal, it may be a 
vertebrate or a nonvertebrate animal. Nonvertebrate animals 
are chiefly of interest when they act as pathogens or 
5 parasites, and the drugs are intended to act as biocidic or 
biostatic agents. Nonvertebrate animals of interest include 
worms, molluskS/ and arthropods. 

The target organism may also be a vertebrate animal, 
i.e., a mammal, bird, reptile, fish or amphibian. Among 
10 mammals, the target animal preferably belongs to the order 
Primata (humans, apes and monkeys) , Artiodactyla (e.g., 
•cows, pigs, sheep, goats, horses), Rodenta (e.g./ mice, 
rats) Lagomorpha (e.g., rabbits., hares), or Carnivora- (e.g., 
cats, dogs) . Among birds, the target animals are preferably 
15 of the orders Anseriformes (e.g., ducks, geese, swans) or 
Galliformes (e.g., quails,- grouse, pheasants, turkeys and 
chickens). Among fish, the target animal is preferably ' of 
the order Clupeiformes (e.g., sardines, shad, anchovies, 
whitefish, salmon) . 

20 

Target Tissues 

. The term "target tissue" refers to any whole animal, 
physiological system, whole organ, part of organ, 
miscellaneous tissue, cell, or cell component (e.g.,. the 

25 cell membrane) of a target animal in which biological 
activity may be measured. 

Routinely in mammals one would choose to compare and 
contrast the biological impact on virtually any and all 
tissues which express the subject receptor protein. The 

30 main tissues to use are: brain, heart, lung, kidney, liver, 
pancreas, skin, intestines, adipose, stomach, skeletal 
muscle, adrenal glands, breast, prostate, vasculature, 
retina, cornea, thyroid gland, parathyroid glands, thymus, 
bone marrow, bone, etc. 

35 Another classification would be by cell type: B cells, 

T cells, macrophages, neutrophils, eosinophils, mast cells, 
platelets, megakaryocytes, erythrocytes, bone marrow stomal 
cells, fibroblasts, neurons, astrocytes, neuroglia. 



wo 2004/013289 



PCT/US2003/024053 



65 

microglia^ epithelial cells (from any organ, e.g. skin/ 
breast, prostate, lung, intestines etc) , cardiac xouscle 
cells, smooth muscle cells, striated muscle cells, 
osteoblasts, osteocytes, chondroblasts, chondrocytes, 
keratinocytes/ melanocytes, etc. 

Of course, in the case of a unicellular organism, there 
is no distinction between the "target organism" and the 
"target tissue". 

Screening Assays 

Assays intended to deternjiine the binding or the 
biological activity of a substance are called preliminary 
screening assays. 

. Screening assays will typically be either in vitro 
, (cell-free) assays (for binding to an immobilized receptor) 
or cell-based assays (for alterations in the phenotype of - 
the cell) . They will not involve screening of whole 
multicellular organisms, or isolated * organs . The comments 
on diagnostic biological assays apply mutatis mutandis to 
screening cell-based assays. 

In Vitro vs. In Vivo Assays 

The term in vivo is descriptive of an event, such as 
binding or enzymatic action, which occurs within a living 
organism. The organism in question may, however, be 
genetically modified. The term in vitro refers to an .event 
which occurs outside a living organism. Parts of an 
organism (e.g., a membrane, or an isolated biochemical) are 
used, together with artificial substrates and/or conditions. 
For the purpose of the present invention, the term in vitro 
excludes events occurring inside or on an intact cell, 
whether of a unicellular or multicellular organism. 

In vivo assays include both cell-based assays, and 
organismic assays. The cell-based assays include both assays 
on unicellular organisms, and assays on isolated cells or 
cell cultures derived from multicellular organisms. The 
cell cultures may be mixed, provided that they are not 
organized into tissues or organs. The term organismic assay 
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refers to assays on whole multicellular organisms/ and 
assays on isolated organs, or tissues of such organisms. 

In vitro Diagnostic Methods and Reagents 

5 . • 

The in vitro assay-s of the present invention may be 
applied to any suitable analyte-containing sample, and may 
be qualitative or quantitative in nature.. 

10 Sample 

The .sample will normally be a biological fluid, such as 
blood, urine, lymph, semen, milk, or cerebrospinal fluid, or 
a fraction or derivative thereof, or a biological tissue, in 
the. form of, e.g., a tissue section or homogenate. However, 

15 the sample conceivably could be (or derived from) a food or 
• beverage, a pharmaceutical or diagnostic composition, soil, 
or surface or ground water. .If a biological fluid or 
tissue, it may be taken from a human or other mammal, 
vertebrate or animal, or from a plant. The preferred sample 

20 is blood, or a fraction or derivative thereof. 

Binding and Reaction Assays 

The assay may be a binding assay, in which one step 
involves the binding of a diagnostic reagent to the analyte, 

25 or a reaction assay, which involves the reaction of a 

reagent with the analyte. The reagents used in a binding 
assay may be classified as to the nature of their 
interaction with analyte: (1) analyte analogues, or (2) 
analyte binding molecules (ABM) . They may be labeled or 

30 insolubilized. 

In a reaction assay, the assay may look for a direct 
reaction between the analyte and a reagent which is reactive 
with the analyte, or if the analyte is an enzyme or enzyme 
inhibitor, for a reaction catalyzed or inhibited by the 

35 analyte. The reagent may be a reactant, a catalyst, or an 
inhibitor for the reaction. 

An assay may involve a cascade of steps in which the 
product of one step acts as the target for the next step. 
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These steps may be binding steps, reaction steps, or a 
combination thereof. 

Signal Producing System (SPS) 

5 

In order to detect the presence, or measure the amount, 
of an analyte, the assay must provide for a signal producing 
system (SPS) in which there. is a detectable difference in 
the signal produced, depending on whether the analyte is 

10 present or absent (or, in a quantitative assay, on the 

amount of the analyte) . The detectable signal may be one 
which is visually detectable, or one detectable only with 
instruments. Possible signals • include production of colored 
or luminescent products, alteration of the characteristics 

15 (including amplitude or polarization) of absorption or 

emission of radiation* by an assay component or product, and 
precipitation or agglutination of a component or product. 
The term "signal" is intended to include the discontinuance 
of an existing signal, or a change in the rate of change of 

20 an observable parameter, rather than a change in its 

absolute value * The signal may be monitored manually or 
automatically. 

In a reaction assay, the signal is often a product of 
the reaction. In a binding assay, it is normally provided 

25 by a label borne by a labeled reagent. 

Labels 

The component of the signal producing system which is 
most intimately associated with the diagnostic reagent. is 

30 called the "label". A label may be, e.g., a radioisotope, a 
fluorophore, an enzyme, a co-enzyme, an enzyme substrate, an 
electron-dense compound, an agglutinable particle. 

The radioactive isotope can be detected by such means 
as the use of a gamma counter or a scintillation counter or 

35 by autoradiography. Isotopes which are particularly useful 
for the purpose of the present invention include ^H, ^^^I, 
^3^1, 3^3, ^^C, and ^^P. "^I is preferred for antibody 
labeling. 
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The label may also be a flubrophore. When the 
f luorescently labeled reagent is exposed to light of the 
proper wave length, its presence can then be detected due to 
fluorescence. Among the most commonly used fluorescent 
5 labelling compounds are fluorescein isothiocyanate/ 

rhodamine, phycoerythrin, phycocyanin, allophycocyanin,. o- 
phthaldehyde and f luorescamine. 

Alternatively, fluorescence-emitting metals such as 
"*Eu, or others of the lanthanide series, may be incorporate 

10 into a diagnostic reagent using such metal chelating groups 
•as diethylenetriaminepentaacetic acid (DTPA) of 
ethylenediamine-tetraacetic acid. (EDTA) . 

The label may also, be a chemi luminescent compound.. The 
presence of the chemiluminescently labeled reagent is then 

15 determined by detecting the presence of luminescence that 
arises during the course of a chemical reaction. Examples 
of particularly useful chemi luminescent labeling compounds 
are luminol, isolumino, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

20 Likewise, a bioluminescent compound may be used for 

labeling, Bioluitiinescence is a type of chemiluminescence 
found in biological systems in which a catalytic protein 
increases the efficiency of the chemi luminescent reaction. 
The presence of a bioluminescent protein is determined by 

25 detecting the presence of luminescence. Important 

bioluminescent compounds for purposes of labeling are 
luciferin, lucif erase and aeqiiorin. 

Enzyme labels, such as horseradish peroxidase and 
alkaline phosphatase, are preferred. When an enzyme label 

30 is used, the signal producing system must also include a 

substrate for the enzyme. If the enzymatic reaction product 
is not itself detectable, the SPS will include one or more 
additional reactants so that a detectable product appears. 

An enzyme analyte may act as its own label if an enzyme 

35 inhibitor is used as a diagnostic reagent. 

Binding- Assay Formats 

Binding assays may be divided into two basic types. 
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heterogeneous and homogeneous. In heterogeneous assays , the 
interaction between the affinity molecule and the analyte 
does not affect the label, hence, to determine the amount or 
presence of analyte, bound label must be separated from free 
5 label; In homogeneous assays, the interaction does affect, 
the activity of the label,, and therefore analyte levels can 
be deduced without the need for a separation step. 

In one embodiment, the ABM is insolubilized by coupling 
it to. a macromolecular. support, and analyte in the sample is 

10 allowed to compete with a known quantity of a labeled or 
specifically labelable analyte analogue. The "analyte . 
analogue" is a molecule capable of competing with analyte 
•for binding to the ABM, and the term is intended to include 
analyte itself. It may be labeled already, or it may be 

15 labeled subsequently by specifically binding the label to a 
moiety differentiating the analyte analogue from analyte. 
The solid and liquid phases are separated, and the labeled 
analyte analogue in one phase is quantified. The higher the 
level of analyte analogue in the solid phase, i.e., 

20 sticking to the ABM, the lower the level of analyte in the 
' sample. 

In a "sandwich assay", both an insolubilized ABM, and a 
labeled ABM are employed. The analyte is captured by the 
insolubilized ABM and is tagged by the labeled ABM, forming 
25 a ternary complex. The reagents may be added to the sample 
in either order, or simultaneously. The ABMs may be the 
same or different. The amount of labeled ABM in the ternary 
•complex is directly proportional to the amount of analyte in 
the sample. 

30 The two embodiments described above are both 

heterogeneous assays. However, homogeneous assays are 
conceivable. The key is that the label be affected by 
whether or not the complex is formed. 
Conjugation Methods 

35 A label may be conjugated, directly or indirectly 

(e.g., through a labeled anti-ABM antibody), covalently 
(e.g., with SPDP) or noncovalently, to the ABM, to produce a 
diagnostic reagent. Similarly, the ABM may be conjugated to 
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a solid phase support to form a solid phase ("capture") 

diagnostic reagent. 

Suitable supports include glass, polystyrene, 

polypropylene, polyethylene, dextran, nylon, amylases, 
5 natural and modified celluloses, polyacrylamides, agaroses, 

and magnetite. The nature of the carrier can be either. 

soluble to some extent or insoluble for the purposes of the 

present invention. 

The support material may have virtually any possible 
10 . structural configuration so long as the coupled molecule is 

capable of binding to its target. Thus the support 

configuration may be spherical, as in a bead, or 

cylindrical, as in the inside surface of a test tube, or the 

external surface of a rod. Alternatively, the surface may 
15 be flat such as a sheet, test strip, etc. 

Biological Assays 

A biological assay measures or detects a biological 
response of a biological entity to a substance. 

20 The biological entity may be a whole organism, an 

isolated organ or tissue, freshly isolated cells, an 
' immortalized cell line, or a subcellular component (such as 
a membrane; this term should not be construed as including 
an isolated receptor) . The entity may be, or may be derived 

25 from, an organism which occurs in nature, or which is 
modified in some way. Modifications may be genetic 
(including radiation and chemical mutants, and genetic 
engineering) or somatic . (e.g. , surgical, chemical, etc.). 
In the case of a multicellular entity, the modifications may 

30 affect some or all cells. The entity need not be the target 
organism, or a derivative thereof, if there is a reasonable 
correlation between bioassay activity in the assay entity 
and biological activity in the target,'organism. 

The entity is placed in a particular environment, which 

35 • may be more or less natural- For example, a culture medium 
may, but need not, contain serum or serum substitutes, and 
it may, but need not, include a support matrix of some kind, 
it may be still, or agitated. It may contain particular 
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biological or chemical agents^ or have particular physical 
parameters (e.g., temperature), that are intended to nourish 
or challenge the biological entity. 

There must also be a detectable biological marker for 
5 : the response. " At the cellular level, the most common 

. markers are cell survival and proliferation, cell behavior 
(clustering, motility), cell morphology (shape, color), and 
biochemical activity (overall DNA synthesis, overall protein 
synthesis, and specific metabolic activities, such, as 

10*. utilization of particular nutrients, e.g., consumption of 
oxygen, production of COj, production of organic acids, 
uptake or discharge of .ions) 1 

The direct signal produced by the biological marker may 
be transformed by a signal producing system into a different- 

15 signal which is more observable, for example, a fluorescent 
or colorimetric signal. 

The entity, environment, marker and signal producing 
. system are chosen to achieve a clinically acceptable level 
of sensitivity, specificity and accuracy. 

20 In some cases, the goal will be to identify substances 

which mediate the biological activity of a natural 
biological entity, and the assay is carried out directly 
with that entity. In other cases, the biological entity is 
used simply as a model of some more complex (or otherwise 

25 inconvenient to work with) biological entity. In that 

event, the model biological entity is used because activity 
in the model system is considered more predictive of 
activity in the 'ultimate natural biological entity than is 
simple binding activity in an in vitro system. The model 

30 entity is used instead of the ultimate entity because the 

former is more expensive or slower to work with, or because 
ethical considerations forbid working with the ultimate 
entity yet. 

The mo.del entity may be naturally occurring, if the 
35 model entity usefully models the ultimate entity under some 
conditions. Or it may be non-naturally occurring, with 
modifications that increase its resemblance to the ultimate 
entity. 
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Transgenic animals , such as transgenic mice/ rats, and 
rabbits, have been found useful as model systems. 

In cell-based model assays', where the biological 
activity is mediated by binding to a receptor (target 
5 protein) , the receptor may be functionally connected to a 
signal (biological marker) producing system, which may be 
endogenous or exogenous to the cell. 
There are a number of techniques of doing this. 

10 " Zero-Hybrid" Systems 

In these systems, the binding of a peptide to the 
target protein results in a screenable or selectable 
phenotypic change, without resort -to fusing the target 
• •.protein (or a ligand binding moiety thereof) to an 
15 endogenous protein. It may be that the target protein is 

endogenous to the host cell, or is substantiaLly identical.- 
to an endogenous receptor so that it can take advantage of * 
the latter 's native signal transduction pathway. Or 
sufficient elements of the signal transduction pathway 
20 normally associated with the target protein may be 

engineered into the cell so that the cell signals binding to 
the target protein. 

"One-Hybrid" Systems 

In these systems, a chimera receptor, a hybrid of the 
target protein and an endogenous receptor, is used. The 
chimeric receptor has the ligand binding characteristics of 
the target protein and the signal transduction 
characteristics of the endogenous receptor. Thus, the 
normal signal transduction pathway of the endogenous 
receptor is subverted. 

Preferably, the endogenous receptor is inactivated, or 
the conditions of the assay avoid activation of the 
endogenous receptor, to improve the signal-to-noise ratio. 
See Fowlkes USP 5,789,184 for a yeast system. 
Another type of "one-hybrid" system combines a peptide: 
DNA-binding domain fusion with an unfused target receptor 
that possesses an activation domain. 
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"Two-Hybrid" System 

In a preferred embodiment, the cell-based assay is a 
two hybrid system. This term implies that the ligand is 
incorporated into a first hybrid protein, and the receptor 
5 into a second hybrid protein. The first hybrid also 

comprises component A of a signal generating system, apd the 
second hybrid comprises component B of that system. 
Components A and B, by themselves, are insufficient to 
generate a signal. However, if the ligand binds the 
10 receptor, components A and B are brought into sufficiently - 
close proximity so that they can cooperate to generate a 
signal. 

Components A and B may naturally occur, or be 
substantially identical to moieties which naturally occur, 
15 as components of a single naturally occurring biomolecule, 
or they may naturally occur, or be substantially identical 
to moieties which naturally occur, as separate naturally 
occurring biomolecules which interact in nature. 

20 Two-Hybrid System: Transcription Factor Type 

In a preferred "two-hybrid" embodiment, one member- of a 
peptide ligand: receptor binding pair is expressed as a 
fusion to a DNA-binding domain (DBD) from a transcription 
factor (this fusion protein is called the "bait"), and the 

25 other is expressed as a fusion to a transactivation domain 

.(TAD) (this fusion- protein is called the "fish", the "prey", 
or the "catch") . The transactivation domain should be 
complementary to the DNArbinding domain, i.e., it should 
interact with the latter so as to activate transcription of 

30 a specially designed reporter gene that carries a binding 

site for the DNA-binding domain. Naturally, the two fusion 
proteins must likewise be complementary. 

This complementarity may be achieved by use of the 
complementary and separable DNA-binding and transcriptional 

35 activator domains of a single transcriptional activator 

protein, or one may use complementary domains derived from 
different proteins. The domains may be identical to the 
native domains, or mutants thereof. The assay members may 
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be fused directly to the DBD or TAD, or fused through an 
intermediated linker. 

The target DNA operator may. be the native operator 
sequence, or a mutant operator. Mutations in the operator 
5 may be coordinated with mutations. in the DBD and the TAD, 

An example of a suitable transcription activation system is 
one comprising the DNA-binding domain from the bacterial 
repressor LexA and the activation domain from the yeast 
transcription factor Gal4, with the reporter gene operably 
10 linked to the LexA operator. 

It is not necessary to employ the intact target 
receptor; just the ligand-binding moiety is sufficient. 

The two fusion proteins may be expressed from the same 
or different vectors. Likewise, the activatable reporter 
15 .gene may be expressed from the same vector as either fusion 
protein (or both proteins), or from a third vector. 

Potential DNA-binding domains include Gal4, LexA, and 
mutant domains substantially identical to the above. 

Potential activation domains include E. coli B42, Gal4 
20 activation domain II, and HSV VP16, and mutant domains 
substantially identical to the above. 

Potential operators include the native Operators for 
the desired activation domain, and mutant domains 
substantially identical to the native operator. 
25 The fusion proteins may comprise nuclear localization 

signals. 

The assay system will include a signal producing 
system, too. The first element of this system is a reporter 
gene operably linked to an operator responsive to the DBD 

JO and TAD of choice. The expression of this reporter gene 
will result, directly or indirectly, in a selectable or 
screenable phenotype (the signal) . The signal producing 
system may include, besides the reporter gene, additional 
genetic or biochemical elements which cooperate in the 

5 production of the signal. Such an element could be, for 

example, a selective agent in the cell growth medium. There 
may be more than one signal producing system, and the system 
may include more than one reporter gene. 
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The sensitivity of the system may be adjusted by, e.g., 
use of competitive inhibitors of any step in the activation 
or signal production process, increasing or decreasing the 
number of operators, using a stronger or weaker DBD or TAD, 
5 etc. 

When the signal is the death or survival of the cell in 
question, or proliferation or nonproliferation of the cell 
in question, the assay is said to be* a selection. When the 

• .signal merely results in .a detectable phenotype by which the 
10 signalling cell may be differentiated from the same cell in 

• a nonsignalling state (either way being a living cell) , the 
assay is a screen. However, the term "screening assay" may 
be used in a broader sense to include a selection. When the . 
narrower sense is intended, we will use the term 

15 "nonselective screen". 

Various screening and selection systems ar^e discussed 
in Ladner, USP 5,198,346. 

Screening and selection may be for or against the 
■ peptide: target protein or compound : target protein 
20 . interaction. 

Preferred assay cells are microbial (bacterial, yeast, 
algal, protozooal) , invertebrate, vertebrate (esp. 
mammalian, particularly human) . The best developed two- 
hybrid assays are yeast and mammalian systems. 
25 Normally, two hybrid assays are used to determine 

whether a protein X and a protein Y interact, by virtue of 
their ability to reconstitute the interaction of the DBD and 
the TAD. However, augmented two-hybrid assays have been 
used to detect interactions that depend on a third, non- 
30 protein ligand. 

For more guidance on two-hybrid assays, see Brent and 
Finley, Jr., Ann. Rev. Genet., 31:663-704 (1997); Fremont- 
Racine, et al.. Nature Genetics, 277-281 (16 July 1997); 
Allen, et al . , TIBS, 511-16 (Dec. 1995); LeCrenier, et al., 
35 BioEssays, 20:1-6 (1998); Xu, et al . , Proc. Nat. Acad. sci. 
(USA), 94:12473-8 (Nov. 1992); Esotak, et al., Mol - Cell; 
Biol., 15:5820-9 (1995); Yang, et al.. Nucleic Acids Res., 
23:1152-6 (1995); Bendixen, et al.. Nucleic Acids Res., 
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22:1778-9 (1994); Fuller, et al., BioTechniques, 25:85-92 
(July 1998); Cohen, et al . , PNAS (USA) 95:14272-7 (1998); 
Kolonin and Finley, Jr., PNAS (USA) 95:14266-71 (1998). See 
also Vasavada, et al . , PNAS (USA) 88 : 10686-90 (1991) 
(contingent replication assay), and Rehrauer, et al., J. 
Biol. Chem., 271:23865-73 91996) (LexA repressor cleavage 
assay) . 

Two-Hybrid Systems: reporter Enzyme type 

In another embodiment, the components A* and B 
.reconstitute an enzyme which is not a transcription factor. 

As in the last example^ the effect of the 
reconstitution of the enzyme is a phenotypic change which 
may be a screenable change, a selectable change, or both. 

In vivo Diagnostic Uses 

Radio-labelled ABM may be administered to the human or 
animal subject. Administration is typically by injection, 
e.g., intravenous or arterial or other means of 
administration in a quantity sufficient to permit subsequent . 
dynamic and/or static imaging using suitable radio-detecting 
devices. The dosage is the smallest amount capable of 
providing a diagnostically effective image, and may be 
determined by means conventional in' the art, using known 
radio-imaging agents as a guide. 

Typically, the imaging is carried out on the whole body 
of the subject, or on that portion of the body or organ 
relevant to the condition or disease under study. The 
amount of radio-labelled ABM accumulated at a given point in 
time in relevant target organs can then be quantified. 

A particularly suitable radio-detecting device is a 
scintillation camera, such as a gamma camera. A 
scintillation camera is a stationary device that can be used 
to image distribution of radio-labelled ABM. The detection 
device in the camera senses the radioactive decay, the 
distribution of which can be recorded. Data produced by the 
imaging system can be digitized. The digitized information 
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can be analyzed over time discontinuously or continuously. 
The digitized data can be processed to produce images, 
called frames, of the pattern of uptake of the radio- 
labelled ABM in the target organ at a discrete point in 
5 time. In most continuous (dynamic) studies, quantitative 
data is obtained by observing changes in distributions., of 
radioactive decay in target organs over time. In other 
words, a time-activity analysis of the data will illustrate 
uptake through clearance of the radio-labelled binding 

10 protein by the target organs with time. 

Various factors should be taken into consideration, in 
selecting an appropriate radioisotope. The radioisotope 
must be selected wi-th a view to • obtaining good quality 
resolution upon imaging, should be safe for diagnostic use 

15 in humans and animals, and should preferably have a. short 

physical half-iif e-..so as to decrease the amount of radiation 
received by the body. The radioisotope used should 
preferably be pharmacologically inert, and, in the 
quantities administered, should not have any substantial 

20 physiological effect. 

The ABM may be radio-labelled with different isotopes 
of iodine, for example "^I, "^I, or "^I- (see for example, 
U.S. Patent 4,609,725). The extent of radio-labeling must, 
however be monitored, since it will affect the calculations 

25 made based on the imaging results (i.e. a diiodinated ABM 
will result in twice the radiation count of a similar 
monoiodinated ABM over the same time frame) . 

In applications to hximan subjects, it may be desirable 
to use radioisotopes other than "^I for labelling in order t 

30 decrease the total dosimetry exposure of the human body and 
to optimize the detectability of the labelled molecule 
(though this radioisotope can be used if circumstances 
require) . Ready availability for clinical use is also a 
factor. Accordingly, for human applications, preferred 

35 radio-labels are for example, ^^c, ^"'Ga, ^^Ga, ^^Y, ^^^In, 
uamjn, J ^ "6j^e, ^^^Re or ^iiAt. 

The radio-labelled ABM may be prepared by various 
methods. These include radio-halogenation by the chloramine 
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- T method or the lactoperoxidase method and subsequent 
purification by HPLC (high pressure liquid chromatography) , 
for example as described by J. Gutkowska et al in 
"Endocrinology and Metabolism Clinics of America: (1987) ii 
(1):183. Other known methods of radio-labelling can be 
used, such as lODOBEADS"*. • 

There are a number .of different methods of delivering 
the. radio-labelled ABM to the end-user. It may be 
administered by any means that enables the active agent to . 
reach the agent's .site of action in the body of a mammal. 
Because proteins are subject to beiTig digested when 
administered orally, parenteral administration, i.e., 
intravenous, subcutaneous, intramuscular, would ordinarily 
be. used to optimize absorption of an ABM, such as an 
antibody, which is a protein. 
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EXAMPLES 
Example 1 

Differential expression of Genes in Kidnevs of 
Streptozotocin-Treated Mice as a Result of GHR/BP knockout 

To identify genes potentially involved in the 
protection of kidneys during diabetes, we induced diabetes 
by STZ treatment of GHR/BP (knockout) mice (prepared as 

described by Zhou, et al., 1997, supra ) and nontransgenic 
(+/+) mice- These mice had a- 129 Ola/BalbC genetic 
background. A PCR-based strategy was used to generate cDNA 
subtraction libraries from RNA isolated from the kidneys of 
diabetic and +/+ mice after ten weeks of hyperglycemia • 

• To identify genes whose increased expression potentially 
results in protection against glomerulosclerosis, the cDN^s 
expressed in +/+ kidneys were .subtracted from the cDNAs 
expressed in kidneys, resulting in a library (-/-) • 

consisting of cDNAs upregulated in the protected kidney. 

To identify genes whose decreased expression potentially 
results in protection against glomerulosclerosis, the cDNAs 
expressed in kidneys were subtracted from the cDNAs 

expressed in +/+ kidneys, resulting in a library (+/+) 
consisting of cDNAs down-regulated in the protected 
kidney. cDNA clones were spotted on duplicate membranes and 
screened for differential expression using the -/- and +/+ 
subtraction libraries as probes. Differential expression of 
clones displaying differential hybridization to the and 
+/+ library probes was confirmed by northern analysis of RNA 
from diabetic and +/+ kidneys using the clones as 

probes. Nucleotide sequences were determined for each clone 
and compared by BLAST analysis to the sequences in the 
public domain databases. 

We have obtained several partial cDNA clones whose 
altered expression is associated with the protection of the 
kidney from damage . 

Induction of diabetes with streptozotocin 

Two to three month old female mice were made diabetic 
(DB) with daily intraperitoneal injections of streptozotocin 
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(STZ; 80-85 ]ag/g body weight, dissolved in 0.1 M citrate 
buffer at a concentration of 8-8.5 pg/itil) until blood 
glucose levels exceeded 200 mg/dL or for a maximum of 8 
injections. Nondiabetic (ND) controls received daily 
5 injections of citrate buffer. After 4 or 10 weeks of 

diabetes (defined as maintenance of a blood glucose leyel of 
at least 300 mg/dL) , the mice were sacrificed and the right 
kidney quickly removed, weighed, and frozen in liquid 
nitrogen or immediately pi^ocesaed for extraction of RNA. . 
10 The left kidney was perfused through the dorsal aorta with 
phosphate buffered saline followed by 4% paraformaldehyde 
(PFA) in 0-15 Naci, then removed^ cut in half longitudinally 
and stored in 4% PFA for histological analysis. 

15 RNA isolation 

Total RN|V was isolated from kidneys of diabetic female 
GHR/BP +/+ and mice after 4 or 10 weeks of diabetes 

induced by streptozotocin using the RNA STAT-60 Total 
RNA/mRNA Isolation Reagent according to the manufacturer's 

20 instructions (Tel-Test, Friendswood, TX) . 

cDNA synthesis 

Prior to cDNA synthesis, a portion (50 lag) of RNA was 
further purified to remove small RNAs using the RNeasy Mini 

25 protocol for RNA clean up as instructed by the manufacturer 
(Qiagen Inc., Santa Clarita, CA) . cDNA then was synthesized 
using 1 ]jg of this purified sample of total RNA from one 
diabetic GHR/BP +/+ and one diabetic GHR/BP -/- mouse using, 
the SMART PGR cDNA Synthesis Kit according to the 

30 manufacturer's instructions (CLONTECH, Palo Alto, CA) . 

Generation of cDNA Subtraction libraries 

Forward- and reverse-subtracted cDNA libraries were 
generated using the PCR-Select cDNA Subtraction Kit 
35 (CLONTECH, Palo Alto, CA) and the +/+ and -/- samples. One 
library (+/+ or G) included clones down-regulated in the 
diabetic -/- mice and the other library (-/- or H) included 
clones up-regulated in diabetic -/- mice. 
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Isolation of Individual clones 

After generating the cDNA subtraction libraries, the 
PGR product ends were made blunt by treatment with Pfu DNA ' 
polymerase (Stratagene, La Jolla, CA) and subcloned into a 
5 bacterial plasmid vector using the Zero Blunt TOPO PGR 

Cloning Kit as instructed by the manufacturer (Invitrogen 
Corp., Garlsbady GA) . Individual clones were obtained by 
. plating on selective media. 

10. Screening by differential hybridization 

• cDNA arrays of clones from the forward and reverse 
subtracted libraries were screened with probes made from 
each library using the PCR-Select Differential Screening Kit 
according to the manufacturer's instructions (CLONTECH; Palo 

15 Alto, CA) . 

Nucleotide sequence determination 

Plasmid DNA from bacterial colonies carrying the 
differentially expressed cDNA inserts was isolated using the 

20 QIAprep Spin Miniprep Kit according to the manufacturer' s 
instructions (Qiagen Inc., Santa Glarita, GA) . Nucleotide 
sequences were determined by use of the ABI PRISM BigDye 
Terminator Cycle Sequencing Ready Reaction Kit with 
electrophoresis on the ABI PRISM 377 DNA Sequencer (PE 

25 Applied Biosys terns, Foster City, GA) . 

Database Searches 

Nucleotide sequences and predicted amino acid sequences 
were compared to public domain databases using the Blast 2.0 

30 program (National Center for Biotechnology Information, 

National Institutes of Health) . Nucleotide sequences were 
displayed using ABI prism Edit View 1.0.1 (PE Applied 
Biosystems, Foster City, CA) . 

Nucleotide database searches were conducted with the 

35 then current version of BLASTN 2.0.12, see Altschul-, et al., 
^'Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs". Nucleic Acids Res., 25:3389-3402 
(1997) . Searches employed the default parameters, unless 
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otherwise stated. 

For blastN searches, the default was the blastN matrix 
(1,-3), with gap penalties of 5 for existence and 2 for 
extension. 

5 Protein database searches were conducted with the then- 

current version of BLAST X, see Altschul et al . (1997), 
supra . Searches employed the default parameters, unless 
otherwise stated. The scoring matrix was BLOSUM62, with gap 
costs of 11 for existence and 1 for extension. The standard 

10 low complexity filter was used. 

^^ref" indicates that NCBI's RefSeq is the source 
database. The identifier that follows is a RefSeq accession 
number/ not a GenBank accession number. ^^RefSeq sequences 
are. derived from GenBank and provide non-redundant curated 

15 data representing our current knowledge of known genes . Some 
records include additional sequence information that was 
never submitted to an archival database but is available in 
the literature. A small number of sequences are provided 
through collaboration; the underlying primary sequence data 

20 is available in GenBank, but may not be available in any one 
GenBank record. RefSeq sequences are not submitted primary 
sequences. RefSeq records axe owned by NCBI and therefore 
can be updated as needed to maintain current annotation or 
to incorporate additional sequence information.'' See also 

25 httP! //www, ncbi > nlm. nih . aov/LocusLink/ref seq. html 

It will be appreciated by those in the art that the 
exact results of a database search will change from- day to 
day, as new secjuences are added. Also, if you query with a 
longer version of the original sequence, the results will 

30. change. The results given here were obtained at one time 
and no guarantee is made that the exact same hits would be 
obtained in a search on the filing date. However, if an 
alignment between a particular query sequence and a 
particular database sequence is discussed, that alignment 

35 • should. not change (if the parameters and sequences remain 
unchanged) . 

Northern analysis 
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Positive clones, identified by the differential 
hybridization screen, were used as probes in Northern 
hybridization analyses to confirm their differential 
expression. Total RNA isolated from the kidneys of diabetic 

. 5 • female +/+ and mice was resolved by agarose gel 

electrophoresis through a 1% agarose, 1% formaldehyde 
denaturing gel, transferred to positively charged nylon 
membrane, hybridized in DIG Easy Hyb with the cDNA insert 
labeled- by asymmetric PGR with Digoxigenin, and analyzed 

10 using anti-.Digoxigenin-alkaline phosphatase conjugate and 

GDP-Star according to the manufacturer's instructions (Roche 
Diagnostics Corp^ Indianapolis, IN). 

CHARACTERIZATION OF CLOHES 
15 Clone G26 

Insert size: 1527 bp query sequence (with gap) (SEQ ID 
NOs:l and 2) 

Nucleotide database search 

Blast N: 

20 Clone G26 is apparently a partial cDNA of a gene 

encoding a protein similar to Mus mus cuius kidney 3 beta- 
hydroxy steroid dehydo.genase type 4 (Hsd3b4) . The highest 
scoring DNA/alignment (890 bits, E valufe 0.0) was of bases 
3-560 of clone G26 with bases 1602-1048 of Hsd3b4 . (Note 

25 that since the orientation of the. cDNA inserts in the 
cloning vector was not known, ^'plus'' was assigned 
arbitrarily for the purpose of the Blast alignment. So, 
since. the match is to the minus strand of a known DNA, we 
assume that the strand labeled ^^plus'' was actually the minus 

30 strand of G26.) The percentage identity was 96% (540/561), 
with 9 gaps . 

There were four equally high scoring matches : Genbank 
BC013449.1, ref NM_008294.1, dbj AB049424.1 and Genbank 
L16919.1. The first is an M. musculus cDNA ^'similar to 
35 hydroxysteroid dehydr6genase-4 delta <5>-3-beta'' . The . 

second is M. musculus hydroxysteroid dehydogenase-4, delta 
<5>-3-beta. The third is M. musculus clone: 123-1, derived 
from 4x chromosome of T(X;4) 37H translocation. The. fourth 
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is the M. musculus oDNA (pK411A) 3-beta-hydroxysteroid 
dehydogenase/delta-5-delta-4 isomerase (Hsd3b4) • The other 
"high scoring DNA alignments were with genes closely related 
to Hsd3b4. 

5 The highest human matches in the main database were 

ail45b4506lreflNM 000862,11 Homo sapiens hydroxy-delta-5-st . .. • '76- Te-ll 
Qil958l7901einblAL121995,12lHSDJ920G3 Human DNA sequence fro..- !£ 7e-ll 
aH257052lQblS45679>llS45679 3 beta-hydroxysteroid dehydrog... 2fi 7e-ll 
Qil287843leinb|X53321.1|HS3BHD541 H. sapiens inRNA for 3Beta-h. . . Ig 7e-ll . 

10 ' Qil2386Heinb)X55997.1lHS3B5H5E Human gene f or 3-beta-5-hydr . . 76 7e-ll 
Qill79467lgblM3818Q. IIHUMBHSD Human 3-beta-hydroxysteroid d. . . 16, le-ll 
aill77l9^|qblM63397.1lH0M3BHSPD3 Human 3-beta-hydroxysteroi. . . 2£ 7e^ll 
aill7719QlablM35493.1lHOM3BHSP Human placental 3-beta-hydro. . - 1£ 7e-ll 
Qi{184398lQbiM27137 , HhOMHSDB Human 3 beta- hydroxy- 5 -ene st... 7^ 7e--ll 

15 Qill84275lab lM28392.HHUMHODB3 Human 3-beta-hydroxysteroid. . . * T& '76-11 

However, in the human EST library, the top scores were 
Qill578840lablAA0714 60. 11AA0714 60 zm73f03.sl Stratagene neu*. . . fiffi. 0»0 
Qill689820tabt]A129031 ,11JA129037 zolljel2.sl Stratagene neu. . . Ml 0,0 
20 Qill578927iqblAA071539.HAA071539 zm7 4bO 6 . si * Stratagene neu... 7^1 0.0 

aill578624lablAA071072.1lAA071072 zm73fo3.rl Stratagene neu... .jgO ,0.0 

Thus, of the human sequences. Clone G26 is most similar 
to a human EST (gi : 1578840) , not to any of the human cDNAs 
25 • reported in GenBank (see included alignments) . This EST is 
conceivably the cognate human gene. However, there is a 
warning in the database to the effect that ^'there is 
evidence that suggests that the 384-well parent plate of 
this clone contains both human and mouse derived clones. 
30 Thus, the origin of this clone is uncertain.'^ The same 

• warhing appears for gi:1688820, gi:1578927, and gi:1578624. 

Protein database search 

Blast X: The best score in the main database was with 
35 ' Mus musculus -hydroxvsteroid dehydrogenase-4 , delta <5>-:3-^ 
beta, ref NP_032320.1 (score 187 bits, expect 2e-71, 95% 
identity) . 

In the human database, the best scores were for 
QiU504507lrefllQP 000853,11 {NM«000862) hydroxy-delta-5-ste. . . 128 9e-48 
40 ai!l79468lablAAA51831 . II (M38180) 3-beta-hydroxysteroid deh. . . 123.96-48 
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Qil4504509lreflMP 000189. ll (NM_000198) hydrQxy-delta-5-ste. . . i22 2e-46 

aill2Q43436ieinblCACl9801 , ll (AL359553) dJ871G17.4 (novel 3-... 122 9e-45 

alll7488989ireflXP 060821.11 (XM_060821) similar to dJ871Gl... 122 9e-45 

ai|174890Q3lreflXP 060827.11 (XH»060827) ^similar to 3 BETA-... IZZ 3e-44 

aill9979635lqbiAAM08704 > ll (AF252254) 3-beta-hydroxysteroid. . . 122 2e-39 

ait386780lqb|AAA36001.1l (M28392) 3-beta-hydroxysteroid deh. . . 'US. 2e-39 



Note that these proteins are functionally related. 

Northern analysis: Down-regUlated in protected (DB -/-) 
kidney 

ND +/+>DB4-/+«ND-/-»DB-/-. 

The .characteristics of Hsd3b4 are discussed in Clarke, 
et ai., Mol. Endocrinol., 7:1569-78 (1993). Its 
relationship to other types of beta-hydroxysteroid 
dehydrogenases is discussed in Payne, et al., J. Steroid 
Biochem. Mol. Biol., 53:111-8 (1995). 

Clone H8 

Insert size: 875 bp query sequence 875 bp (SEQ ID NO: 2) 
Nucleotide database search 

Blast N: The best mouse match was to gp bF539644.1 EST 
1197 bits e=0.0 97%. In the main database, the best match 
was to Genbank AC023194.8, H. Sapiens clone RP11-216N21, E 
value of 8e-40,' score of 163 bits. Query 107-280 was 
aligned to subject 139697-139870, with 151/174 (86%) 
identities. There were several other alignments, too. 

The score for the match to Homo sapiens ATPase, 
XM_043092.1 was 40 bits, with E=0.008. 

Looking at human ESTs, the top matches were 
qi|15755092!ablBI763514. 1IBI7 63514 603050239F1 NIHJ4GC_116 . . . 612 
aill3293170lablBG399722.1lBG399722 602441316F1 NIH_MGC_75 H. . . 414 
ail5339253labiAl791537lAI791537 oj41bll.y5 NCI_CGAP_Kid3 ... 408 
ai]13340125lQblBG433708. 1IBG433708 602497822F1 NIH__MCG__75 H. . . 363 
aill2158598lQblBF820247.1lBF820247 CM0-RT0018-181100-701-cl . . . 359 
Qil5054161lablAI733048.1lAI733048 oj41bll,x5 NCI_CGAP_Kid3 ... 333 
aill5751872lablBI7660294 . 1IBI760294 603045361F1 NIH_MGC_116. . . 301 . 



Blast X: The top scoring sequences are: 
gi|18570892|ref|XP_088368.1|Hypothetical protein XP_088368 379 e~104 

ail73Q4909lreflNP 038505-11 {NM_013477) ATPase, H+ transpor. . . 370 e-102 
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alil?643287lsplP51ft63lVA0D MOOSE Vacuolar ATP synthase subu. . • 370 e-102 

aill477 6195lref IXP_Q43091 . II ATPase. H+ transporting, lysos... 370 e-102 

aill5029719laT:>lAAHl 1Q75 . 1IAAH11075 (BC011075) Unknown (prot.., 367 e-101 

aill8543319lreflNP 57QQ80.1I (NM_130724) CG2934 gene produc... 35± 3e-97 

5 alll2585456lsplO25531lWA0D MT^SE Vacuolar ATP synthase subu... 352 4e-97 

aill6259229lablAAK85455 , 2IAC006611. 2 (AC006611) Hypothetica . . • 331 2e-92 

aill750576HreflNP 491515.11 (NM_059114) vaculolar ATPase [... 32fi 4e-87 

XP_088368.1 is a hypothetical protein; region 15.. 214 
10 has been identified, as a ^'ATP synthase (C/AC39) ' subunit . 
Sequence xP_088368 .1 * was replaced on May 7, 2002 with 
gi:2048 5329; . ^'similar to Ac39/physophilin (Homo sapiens)" 

XP_04309.1 is ATPase^ H+ transporting, lysosomal 
15 (vacuolar protein pump), H. sapiens. It is also known as 

^Jsimilar to vacuolar ATP synthase subunit D", Between query 
3-818 and, subject 9-279, there was 65% identity (179/272), 
with a score of 370 bits, E=e-102. 

20 . Best mouse match ref NP_038505.1 370 bits e-102 65% 

vacuolar adenosine tri-phosphatase subunit D. 



Northern analysis: Up-r©gulated in protected (DB-/-) 
25 kidney 

DB -/- > > 
ND > = 

ND +/+ = 
DB +/+ 

30 

Cj^one Hi 

Insert size: 966 bp, query sequence 966 bp (SEQ ID 

N0:4) 

Nucleotide database search 
35 Blast N: Matches with GenBank U96635 Mus musculus 

ubiquitin protein ligase Nedd-4 mRNA (1780 bits, E 0.0), 
when bases 1-966 of HI are aligned with bases 3775-4739 of 
the database sequence. The percentage identity is 98% 
(957/969), with 7 gaps. 



wo 2004/013289 




PCT/US2003/024053 



87 

There is an even higher (1804) scoring alignment with 
BC007184, M. musculus clone IMAGE : 3591422 . 

The highest scoring hximan sequence is ref XM_046129.1/ 
Homo sapiens neural precursor cell expressed^ 
5 developmentally down-regulated 4 (NEDD 4), with a score of 
105 bits and E=4e-20. 

There are identical scoring human ESTs, e.g., BG035335 
and BF357965. 

' 10 Protein database search 

Blast X: Nothing, significant.. This is expected; the 
coding sequence of U 96635 is at bases 612-2945, so the 
clone is in 3' OTR of U96635.' 

Northern analysis: Up-regulated in protected (DB -/-) 
-kidney 

DB-/-»DB+/+=ND+/+=ND-/- . 

As described by Kxamar, et al., Genoiuics, 40:435-446 
(1997), Nedd4 encodes a protein which is strongly expressed 
during early CNS development- It contains a calcium/phospho 
lipid binding domain, three protein-protein interaction 
domains (the WW domains) , and a ubiqui tin-protein ligase 
domains. Ubiquitin labels proteins desired for degradation 
by substrate-specific ubiquitin conjugating enzymes and 
ubiquitin-protein ligases. 

The alignment of mouseup human (Genbank D42055) and rat 
(Genbank U50842) DNA sequences is set forth in Kumar, Fig. 
2. 

Pirozzi, et al., J. Biol. Chem., 272:14611-6 (1987) has 
identified other human WW domain-containing proteins; these 
are aligned with, inter alia . Nedd-4 in Fig. 2. 

Clone Fl 

35 Insert size: 1159 bp, query sequence 1159 bp (SEQ ID 

N0:5) 

Nucleotide database search 

Blast N: Matches with ref NM 019973.1 Mus musculus Son 



15 



20 



25 



30 
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cell proliferation protein mRNA (1931 bits^ E«0.0, 99% 
identity between query 150-1159 and subject 413-1419)^ 
GenBank AFl 93607 Mus musculus truncated SON protein (Son) 
mRNA (1931 bits, E value=0,0/ 99% identity between query 
150-1159 and subject 413-1419) and with Genbank AF193597 Mus 
musculus SON protein mRNA (1931 bits, E value 0.0/ 99% 
identity between query 150-1159 and subject 156-1162) . 

The best human match was to ref NM_058183.1 Homo 
sapiens SON DNA binding protein mRNA, 922 bits, E=0.0, 86% 
identity of query 150-1159 to subject 414-1435. 

Protein database search 

Blast X: Matches with ref NP_064357 and AAF23121.1 
(AF193607) Mus musculus truncated SON protein (624 bits, 
E=e-178, 86% identity of query 3-1157 with subject 80-463) 
and GenBank AAF23120.1 (AF193606) -^Mus" musculus Son protein 
(624 bits, E=e-178, 86% identity over 385 a. a.). 

Please see also ref NP_003094.2, the H. sapiens SON 
DNA-binding protein, KIAA1019 (score 497 bits, E=e-140, 
align query 3-1157 to subject 80-468 for 278/390 (71%) 
identities, 5/390 gaps) . 

Northern analysis: up-regulated in protected (DB -/-) 
kidney DB-/-»ND-/-=ND+/+»DB+/+ . 

The SON protein is discussed in Wynn, et al., 
^'Organization and Conservation of the GART-S0N-3SG locus in 
mouse and human genomes'". Genomics, 68 (l):57-62 (2000); its 
gene is within the Down Syndrome Critical Region. 

Of the human sequences, clone Fl is most similar to the 
SON DNA binding protein (gi : 15055517 ) with 71% identity and 
e-140. 

Clone 638 

1050 bp insert, 842 bp query sequence (SEQ ID NO: 18) 

Highest alignment score was with Genbank AF178454.1, Mus 
musculus anti-human apolipoprotein A monoclonal antibody 20 
L Kappa light chain, 

Identities==538/565 (95%), score 888 bits, E=0.O. 
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Highest alignment score with a Human mRNA was with Genbank 
S65921.1, anti-colorectal carcinoma light chain glycoprotein 
CANAG-50 specific IgGl Kappa 

Identities=358/387 (92%), score 529, e-148 • 

5 

At the protein level, the highest score was with gi 
1310829/ pdb IKNO, Igg2a -p-nitrophenyl methyl phosphonate. 
The highest human score was with gb AAB28160.1 (S65921) an 
anti-'Colorectal carcinoma antibody, • score 258, E=3e.-71, See 
10 -Zemel, et al., Mplec. Immunol., 31: 127-37 * (1994) . 

Northern analysis:* up-regulated in protected (DB-/-) 
kidney • 



15 

In like manner, we have identified several additional 
clones, with the following Northern analyses (ND 
nondiabetic/no STZ; DB diabetic/STL; NT nontransgenic;: HO- 
• 20 transgenic for GHR/BP Knock out) : 

G9: NDNT<DBNT=DBHO=NDHO (SEQ ID, NO: 29) 
Cytochrome oxidase III 

25 G16: NDNT=DBNT<DBHO<NDHO (SEQ ID NO: 30) 
Hypothetical protein 

G24: DBNT>NDNT=DBHO=NDHO (SEQ ID NO: 31) 
Homolog to ERG2 protein 

30 

G28: NDNT=DBNT>DBHO=NDHO (SEQ ID NO: 32) 
Glutathione peroxidase 3 

Example 2 

35 Differential Expression of Genes in Kidneys of Nontransaenic 
Mice as a result of Streotozotocin Treatment 

End stage renal disease (ESRD) is a costly outcome of 
diabetes, yet it is a difficult complication to prevent. 



wo 2004/013289 



PCTAJS2003/024053 



90 

Factors that influence its development include heredity, 
blood glucose levels, and blood pressure. ESRD is a 
progressive disease that can take many years to develop. It 
can sometimes be treated, and even reversed, if caught at an 
early stage. Thus, an understanding of the molecular basis 
for the development and progression of diabetic nephropathy 
culminating at ESRD is crucial for the diagnosis, treatment 
and prevention of this debilitating disease. 

'Streptozotocin-induced diabetes in mice provides an 
excellent model system for studying the development of 
insulin-dependent diabetes mellitus (IDDM) nephropathy, at . 
both the physiological and molecular level. We, and others, 
have used this model system to examine the role of specific 
genes or gene products in the development of diabetic 
nephropathy leading to glomerulosclerosis. While 
significant advances have been made, much information is 
still lacking. In an attempt to further our knowledge of 
the molecular mechanisms resulting in diabetic nephropathy, 
we are using a polymerase chain reaction (PCR) -based cDNA 
subtraction method to compare populations of mRNAs expressed 
in kidneys of streptozotocin-induced diabetic mice at 
various developmental stages of the disease with populations 
of mRNAs expressed in healthy kidneys by creating libraries 
containing virtually all of the gene products (cDNAs; novel 
and known) involved in the progression of 
glomerulosclerosis. Nephropathy-correlated genes, thus 
.identified, may be used as molecular markers for disease 
progression and as therapeutic drugs or drug targets. 

In this example, subtractive hybridization was used to 
identify genes differentially expressed in diabetic (STZ- 
treated) and control mice. These mice, unlike those of 
Example 1, were all nontransgenic. 

Several partial cDNA clones were identified whose 
altered expression ds associated with kidney damage. The 
protocol was the same as for Example 1, except as set forth 
below. 

RNA isolation 
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Total RNA was isolated from kidneys of wildtype female 
mice after 4 or 10 weeks of diabetes induced by 
streptozotocin, as well as similarly aged non-diabetic 
wildtype controls, .using the RNA STAT- 60 Total RNA/mRNA 
Isolation Reagent . according to the manufacturer's 
instructions (Tel-Test, Friendswood, TX) . 

cDNA subtraction libraries 

O^ie library (E) included clones up-regulated in . . 

diabetic mice and the other library (F) included clones 
down- regulated in diabetic mi.ce. 

Northern analysis 

Positive clones^ identified by the differential 
hybridization screen, were used as probes in Northern 
hybridization analyses to confirm their differential 
expression. Total RNA isolated from the kidneys of female * 
mice after 10 weeks of diabetes induced by streptozotocin, 
as well as non-diabetic controls, was resolved by agarose 
gel electrophoresis through a 1% agarose, 1% formaldehyde 
denaturing gel, transferred to positively charged nylon 
membrane, hybridized in DIG Easy Hyb with the cDNA insert 
labeled by asymmetric PGR with Digoxigenin, and analyzed 
using anti-Digoxigenin-alkaline phosphatase conjugate and 
CDP-Star according to the manufacturer's instructions (Roche 
Diagnostics Corp., Indianapolis, IN). 

CHARACTERIZATrON OP CLONES. ' 
Clone FX 

The results of a sequ'ence search for this clone are 
discussed in Example 1. Besides being up-r^gulated in 
protected (DB -/-) kidney, it was down-regulated in diabetic 
+/+ kidney 



ND+/+»DB+/+ 



Clone F4 

Insert size: 616 bp query, sequence 616 bp (SEQ ID 

N0:6) 
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Nucleotide database search 

Blast N: Matches with ref. NM_010594.1 Mus musculus 
kidney androgen regulated protein (Kap) inRNA 



612 with subject 3-606) 

With H- sapiens FKSG22 mRNA (AF319957 . 1) , the alignment 
yielded a score of 1098 bits (E==0.0) for query 33-612 with 
subject 1-580 (576/581 [99%] identities). 

Protein database search 

Blast X: Matches with ref. NP_034724.1 (NM 010594) Mus 
musculus kidney androgen regulated protein and h\aman 
AAH08576.1 kidney androgen regulated protein. 



54-416 with subject 1-121) 

Northern analysis: Down-regulated in diabetic kidney 



Of the human sequence^ clone F4 (and F6) is most similar to 
FKSG22 mRNA (gi:12276125) / implicated in ovarian cancer, and 
kidney androgen regulated protein mRNA (gi: 14250295) ; e=0. 
Both mRNAs encode the protein, KAP. 

The kidney androgen-regulated protein (KAP) is encoded 
by a relatively abundant mRNA (4% of poly (A) RNA in male 
mouse kidney) . The concentration of KAP xoRNA is 3-5 fold 
higher in males, or in testosterone-treated females, than in 
untreated females. It appears to be expressed in the 
epithelial cells of the proximal convoluted tubules, in 
particular by cells of the SI, 82 and S3 segments. Several 
possible hormone-responsive elements have been identified in 
the 5 '-flanking region. The KAP gene is a single copy gene, 
but seems to be highly polymorphic. See generally Niu, et 
al., DNA & Cell Biol., 10:41-8 (1991). 

For KAP, see also Homo sapiens, kidney androgen 
regulated protein, clone MC6:18182, IMAGE: 4155347 , mRNA, 
complete cds. Accession #BC008576, version #BC008576.1, GI : 
14250295. 



E Value=0.0 (1154 bits) (99% identity of query 9- 



E value=le-61 (235 bits) (99% identity of query 



ND+/+»>DB+/+ 
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Clone F5 (SEQ XD NO: 7) 

Insert size: -1000 bp, query sequence 881 bp, SEQ ID 

NO: 6 

Nucleotide database search 

Blast N: Top matches are with (1) gi20881938, ref 
XM_127876.1 ^^Mus musculus similar to data source: SPTR, 
source key:Q9Z0S6, evidence :ISS~putative~similar to CLAUDIN- 
10" (score 1233 bits E 0.0), (2) gi 21617848, ref 
NM_0238T8.1, ^^Mus musculus claudin 10 (CldnlO) (score 125 
bits E 0.0)", and : (3) . gil2860620, dbj AK020131 "^Mus.. musculus 
:12 days embryo male wolffian duct includes surrounding 
region cDNA; RIKEN full-length enriched library", (1215 bits 
E 0.0), and several other claudin-lO-related sequences (gb 
•BC021770.1, dbj AK055855.1, BC029019.1, ref NM_021386.1, gb 
AF124425.1), 

The highest-ranked hiaman sequence was gi 16550687, dbj 
AK055855.1, ^^Homo sapiens cDNA FLJ31293 fis, clone 
KIDNE2007569, moderately similar to claudin 10." See also 
gi 16444694, emb AL357061.19, and gi 17475396, ref 
XM_007076.4. 

The cognate human claudin-10 sequence is ref. 
NM_006984.1 (165 bits, E=le-37) or Genbank U89916. See also 
gi 17475396, ref XM_007076.4, and gil5012044, Genbank 
BC0109.20. 

Protein Database search 

Blast X: Best matches were with (1) gi 20881939, ref 
XP_12787 6.1, "'similar to data source :SPTR, source key: 
Q9Z0S6, evidence :ISS~putative~similar to CLAUDIN-10 [Mus 
musculus]" (236 bits, E=le-74) ; (2) gi 21617849, ref 
NP_07 6367 .1, "claudin 10" (234 bits E=3e-74), (3) gi 
16550688, dbj BAB71030.1, AK055855, "unnamed protein product 
[Homo sapiens]" (225 bits, E=4e-7l) ; and (4) gi 18256029, gb 
AAH21770.1, "similar to RIKEN cDNA 6720456116 gene [Mus 
mussculus]" (236 bits, E=le-68) .' 

The cognate human protein sequence is ref NP_008915.1 (E'=6e- 
40) . 
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Northern analysis: Down-regulated in diabetic kidney 
ND+/+>DB+/+ 

Claudins are integral membrane proteins localized as 
tight junctions^ see Kubota, et al., Curr. Biol. 9:1035-8 
(1999); Monta, et al., J. Cell. Biol., 145:579-88 (1999); 
Monta, et al,, Proc. Nat. Acad. Sci. (USA), 96:511-16 
(1999). 

-Clone F6 (SEQ ID NO: 8) 

This is a slightly shorter version (611 bp) of clone 
F4.. • 

Clone £39 

Insert size 981 bp* (SEQ ID N0s:15 and 16) 

.Blast N Similar to gb U65535.1 mouse immunoglobulin 

kappa-chain mRNA, 34 6 bits 2e-92 

The best human match was to gbS65921.1, anti-colorectal 
carcinoma light chain^glycoprotein CANAG-50 specific. IgGl 
kappa 338 bits, 4e-90; See Schable, et al., Eur- J. 
Immunol.^ 29:2082-6 (1999). 

Blast X 

Human protein cognate: 1FH5L, pdb 1FH5 .Chain L crystal 
structure of the Fab Fragment of the Monoclonal Antibody Mak 
33, 93 bits 5e-19. 

See also Mouse gb AAC04542.1, monoclonal antibody kappa 
light chain [mus musculus] 61 bits', . 8e-16 . 

Northern analysis: Up-regulated in diabetic kidney 
(DB+/+»ND+/+) and damaged bGH kidney (bGH>NT) . 

Clone F27 

--1000 bp insett 820 bp query sequence (SEQ ID NO: 17) 
End 337 nt similar to ref NM 013559.1 mus musculus heat 
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shock protein, 105 kDa (Hsp 105) 
Blast N mRNA/ 3387 nt 

Identities=329/338 (97%), E«e-173, 607 bits 
•ref XM_036358.1 Homo sapiens heat shock 105 kD (HSP105B) 
mRNA, 220 bits, 7e-55, 83%. 

dbj BAA11035.1 (D67016) heat shock protein 105KD alpha [mus 
musculus] , 182 bits, 3e-45, 75%. 

Human protein cognate: 

Blast X. Heat shock lOSkD. [homo . sapiens] ref XP:.036357.1 
Identities=95/136 (69%), le-42 
Rest of sequence is novel 

Northern analysis: Down-regulated in diabetic kidney 
(ND+/+>DB+/+) 



The top matches in the nucleic acid alignment were 

Qil7305168lref INM_013559. II Mus niusculus heat shock protein.., 607 e-173 

. aiJ840651lablL4Q406,llMaSHSPE Mus muscuLus heat shock prote... 607 e-173 

aill001008)dbilD67016,llD67016 Mus wagn^ri itiRNA for heat sh. . . S&l e-173 

a±ll0G1010ldbTld67Q17,lld67017 Mus wagneri mRNA for heat sh. . . 529 e-171 

For the top match, NM_.013559, Mus- musculus heat shock 
protein 105 kDa, query 1-337 aligned to subject 894-1231 
with 329/338 (97%)' identities and 1 gap. 

The' best NA alignment to a human sequence wais of query 
1-328 to XM_036358.1/ Homo sapiens heat shock 105 kD, 
subject 797-1125; with 274/329 identities (83%) and 1 gap. 

For the amino acid sequence corresponding to 
NM_013559-1, that is, the Mus musculus 105 kDa heat shock ' 
protein. See Morozov, et al., FEBS Lett., 371 (3) :214-18 
(1995), and has the ID of NP_038587.1 (which refers to it as 
a ^^110 kDa'' protein) . 



In like manner, additional clones have been identified, 
with the following Northern analyses (ND, nondiabetic; DB, 
diabetic) : 
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P2: ND>DB; Far upstream element (FUSE) binding protein 1 

(SEQ ID NO:23) 
PI 6: ND>DB; Unknown (SEQ ID NO: 34) 
F21: ND<DB; Unknown (SEQ ID NOS:24, 25) 

F38.: ND>DB; Palmitylated serine/ threonine kinase (SEQ ID 
NO:26) 

P39: ND>DB; Phosphotriesterase related protein (SEQ ID NO:27) 
P40: ND<DB; Similar to mouse testicular tumor differentially- 
expressed gene (TDE) 1 (SEQ ID NO:28) 

FUSE binding protein 1 is discussed in Duncan, et al.^ 
Genes & Development, 8:465-80 (1994). The c-myc proto- 
oncogene stimulated by it is involved in normal cell 
proliferation and programmed cell death. The FUSE is 
required for maximal transcription of c-myc. FBP stimulates 
expression of c-myc in a FUSE-dependent manner and is 
believed to be involved in regulation of c-myc. 

A 305-aa embryo-derived serine /threonine kinase • 
(EDPK) , which we have identified with clone F38, was 
isolated by Kurioka, et al., Biochim. Biophys. Acta, 14 43: 
275-84 (1998) . It is believed to play a role in 
intracellular signaling during embryogenesis . Its catalytic 
domain is 271 a, a- Another serine/threonine kinase, Krct, is 
described in Stairs, et al.. Human Molec- Genet., 7: 2157-66 



A mouse cDNA {mpr56-l) with .homology to a prokaryotic 
parathion hydrolase (phosphotriesterase) -encoding gene is 
discussed in .Hou, et al . , Gene, 168: 157-63 (1996). This 
gene is expressed primarily in liver, and post-natally . .It 
is underexpressed in cystic kidneys. Expression is 
decreased upon acute renal injury induced by a single 
intraperitoneal injection of folic acid. The protein is 349 
a.a. There is greater than 50% a. a. similarity to the 
procaryotic prathion hydrolase. 

The human TDE homologue is analyzed in Bossolasco, et 
al., Molec. Carcinogenesis, 26: 189-200 (1999). It is 78% 
homologous to the mouse TDE amino acid sequence. 



(1998) . 
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pxample 3 

Different i al Expression of Genes in Kidneys D^ice a, 
Result of Overproduction of G rowth Hormone 

In this example, genes were identifieti which were 
differentially expressed in (1) transgenic mice with kidney 
damage as a result of overexpression of bovine growth 
hormone, as compared to (2) nontransgenic control mice. 

A PGR based cDNA subtraction strategy was used to 
create libraries cont;aining genes potentially involved in 
the progressive development of nephropathy. Total RNA was 
isolated from the kidneys .of bGH- and nontransgenic (NT), 
control mice at 2 months, 5 months, and 12 months of age and 
used to create the two cDNA libraries for each time point. . 
The bGH libraries at each time poi.nt consisted of cDNAs 
upregulated in the bGH a^ compared to NT mice. The NT- 
libraries consisted of cDNAs down-regulated in. the bGH as 
compared to NT. mice (i.e., upregulated in the NT as compared 
to the bGH mice) . Subsequently, cDNA clones from each 
library were spotted on replicate membranes. Dif f eren.tial 
hybridization of the two membranes, one probed with the bGH 
library and the other probed with the NT library, was used 
to select differentially expressed clones. Dif f erentiell 
hybridization of clones from the two libraries at 2 months 
of age showed no differential expression. At 5 months of 
age a number- of differentially expressed clones were 
observed. At 12 months of age an even larger number of 
differentially expressed clones were seen. Thus far. 
Northern Analysis has revealed ,that several clones from the 
12 month bGH library are highly expressed in the kidneys of 
bGH mice as compared to NT controls. Furthermore, their 
expression increases with time, correlating with development 
of glomerulosclerosis. 

The protocols were the same as for Example 1, with the 
following exceptions . 



Mice 

The mice had a B6/SJL genetic background. Inserted in 
their genome was the bGH cDNA clone whose expression was 
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driven by the mouse metallothionein I transcriptional 
regulatory sequences. 

RNA isolation 

Total RNA was isolated from kidneys of 2^ 5 and 12 
month old female bovine growth hormone transgenic (bGH) mice 
and non-transgenic controls using the RNA STAT- 60 Total" 
RNA/mRNA Isolation Reagent according. to the manufacturer's 
instructions (Tel-Test, Friendswood, TX) . . 

cDNA Subtraction Libraries 

The forward (.bGH) subtracted library included clones 
up-regulated in bGH mice and the reverse (NT) library 
included clones down regulated in bGH mice. 

Screening by differential hybridization 

cDNA arrays of clones from the forward and reverse 
subtracted libraries at each time point {2, 5, and 12 months 
of age) were screened with. probes made from each library of 
the corresponding time point using the PCR-Select 
Differential Screening Kit according to the manufacturer's 
instructions (CLONTECH, Palo Alto, CA) . 

Northern Analysis 

Positive clones, identified by the differential 
hybridizatioii screen, were used as probes in Northern 
hybridization analyses to confirm their differential 
expression. Total RNA isolated from the kidneys of 2, 5 and 
12 months of age female bGH and non-trarisgenic mice was 
resolved by agarose gel electrophoresis through a 1% ' 
agarose, 1% formaldehyde denaturing gel, transferred to 
positively charged nylon membrane, hybridized in DIG Easy 
Hyb with the cDNA insert labeled by asymmetric PGR with 
Digoxigenin, and analyzed using anti-Digoxigenin-alkaline 
phosphatase conjugate and CDP-Star according to the 
manufacturer's instructions (Roche Diagnostics Corp-, 
Indianapolis, IN) • 
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CHARACTERIZATION OF CLONES 

Genes up regulated in the 2, 5 or 12-month bGH 
libraries are considered unfavorable, and those up-regulated 
in the corresponding NT libraries are considered favorable, 
i.e., likely to encode a protective protein. 

Clone A8 (12 month bGH .library) 

..Insert 334 bp, query sequence 334 bp (SEQ ID NO: 9) 
By BlastN, the highest similarity was to M21796.1; 
Mouse (M- spretus) Ig germline Kappa chain gene C-region, 
exon, with a score of 640 bits (E=0.0), and 99% identity of 
query 8-334 with subject 1287-16JL3. Highest similarity to 
human clone was S65921,l anticolorectal carcinoma light 
chain = glycoprotein CANAG-50 specific IgGi kappa (57 9 bits, 
97% identity) e-163. 

By BlastX, there were a very large niimber of sequences- 
with the same score (8 6 bits) and E* value {9e-17) . Almost 
all were immunoglobulin genes. .The best matching human 
clone was AAB28160 . 1 (S65921) anti-colorectal carcinoma light 
chain (86 bits', 9*7% identity 9e-17) . 

Northern analysis: Expression in kidneys of bGH— 
transgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 
Highly expressed in spleen, lung, intestine, stomach, 
kidney, and omental fat; very low levels in liver, heart, 
gastrocnemius, and brain of nontransgenic mouse. 

Clone A34 (12 month bGH library) 

Insert size: 503 bp, query 503 bp (SEQ ID NO: 10) . 

By BlastN, this has a score of 636 bits (E=0.0) for a* 
99% identical alignment of query 139-463 to subject 1323- 
999, the subject being emb X03690.1, M. musculus mRNA for Ig 
heavy chain constant region mu(b) allele. The best human 
match was to AF144029.1 Homo sapiens MDM2 gene, intron 9 and 
exon 10, partial sequence (208 bits, 2e-51, 99% identity). 

By blastX, the score is 224 bits (E=3e-58) for a 99% 
identical alignment to pir A24976, Ig mu Chain C region 
(allele b) of the mouse. The best human alignment was to 
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AAH01872.1 Unknown (191 bits, 4e-49) , 84% identity) as well 
as many others. 

Northern analysis: Expression in kidneys of bGH- 
tfansgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 

Clone il39 (12 month 1>6H library) 

Insert Size; 482 bpr query sequence 482 bp 
. (SEQ ID N0:11) 

. By BlastN the highest score (879 bits, E«0.0) was with . 
BCQ1Q327.1, Mus musculus,. clone MGC:6533, image : 265177 6, 
435/457 identity (99%) of query 26-482 with subject * 425-881 . 
Next best (7 82 bits) was a mouse mRNA for ♦gamma-2b 
immunoglobulin (emb V00799.1). The best human match was 
S657 61, anti-colorectal carcinoma heavy chain glycoprotein 
CANAG-50 specific IgGl kappa antibody. The score was 300 
bits (E=5e-79), with 212/233 . (90%) identity aligning query 
133-365 to subject 513-^745. 

By BlastX BC010327 was again the highest scorer (233 
bits, E=4e-61), with 117/154 (75% identity) in query 19-480 
vs. subject 122-275- The next best score was to AAA 51630-1 
Ig H-chain (v-region from MPCll) of mus musculus (226 bits, 
8e-59, 75% identity) . Arid the highest h\iman scorer was 
AAB28159.1, corresponding to S65761. The observed identity 
was 87/157 (55%) . 

For information on gamma 2b transgenic mice, see Storb, 
et al., Immunol. Res., 13:291-8 (1994), and Ann. NY Acad. 
Sci. 546:51-6 (1988) . 

Northern analysis: Expression in kidneys of bGH- .. 
transgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 



Clone A48 (12 month b6H library) 

Insert 430 bp, query sequence 430 bp (SEQ ID NO: 12) 
By BlastN^ several sequences produced substantially 

equally significant alignment (E value 0.0) : 

BC008237.1 Mus musculus, similar to immunoglobulin 

heavy chain 4 (serxam IgGl)-, clone MGC:6522 IMAGE: Z651217 



wo 2004/013289 



PCT/US2003/0240S3 



101 



mRNA 

BC002121.1 mus musculus similar to immunoglobulin heavy 
chain 4 (serum IgGl) / clone mGC:6628 IMAGE:3491766, mRNA 



L35252.1 mouse germ line immunoglobulin gamma-1 (Ighb) 

5 mRNA 

Best human match was S65761.1 anticolorectal carcinoma 
heavy-chain=glycoprotein CANAG-50 specific IgGl kappa 
antibody (684 bits, e=0.0, 97%. identity). 
Note that' these were all Ig-related. 
10 By BlastX, the best score was for Genbank AAB59665 

• (M60434), score 254, E=le-67. This is an Ig heavy chain 
constant region from Mus domesticus. 116/122 95% identity of 
query 3-368 to subject 227-348. The best human alignment 
was to S657 61 anti-colorectal carcinoma (see above) (204 
15 bits, 3e-53, 94% identity) . 

Northern analysis: Expression in kidneys of bGH- 
transgenic mice increases with age and is greater than 
levels seen in age-matched nontransgenic control mice. 

20 Clone B45 (12 month NT library) 

Insert Size: 610 bp, query sequence 610 bp (SEQ ID 
NO: 13) 

By BlastN/ the highest scoring siiailarity was with ref 
NM_010594.1, Mus musculus kidney androgen regulated protein 

25 mRNA (1170 bits, E=0-0), for 99% identity. of query 9-610 
with subject 4-605- Note also the high score for the 
closely related sequence M22810.1 (1168 bits). Best hioman 
match was to AF319957 Homo sapiens FKSG22 mRNA (1132 bits, 
E=0.0, 99% identity). 

30 By BlastX, the highest score was for ref NP_034724.1 

and (pl5267) kidney androgen regulated protein (238 bits, E 
of 3e-62), for 100% identity of query 53-415 with subject 1- 
121. The rat homologue, ref NP_434689.1, scored 103 bits, 
E=le-21. AAG50272.1 corresponds to the best human BLASTN 

35 match, Homo sapiens FKSG22 mRNA. 

Northern analysis: Expression in kidneys of bGH- 
transgenic mice is less than levels see in age-matched 
nontransgenic control mice and decreases with age. 
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Clone C22 (5 month b6H IdLbrary) 

query sequence 731 bp (SEQ ID NO: 14) 

By Blastn, the best alignment is to Genbank 
BC016887.1 Mus musculus clone MGC:18401 IMAGE : 4242174 iiiRNA 
5 and Genbank AF260580.1, Mus musculus disabled-2 p96 (Dab2) 

gene, score of 1243 bits (E 0.0) ^ for 97% alignment of: query 
1-729 to subject 28489-27769- The best human alignment (58 
bits) is to the human homologue^ Genbank AF2058 90.1 (AFX 
21883932)^ and- AG008846 . 7,, •E=7e-06, 43/47 identities. . 
10 No significant Blas't X results, probably in 3' 

noncoding region. 

• • Either this human sequence, or the much longer mouse 
sequence, could be used as a probe to isolate the full 
length human DNA. 

15 Dab2 is a widely expressed relative of Dabl/ a neuron- 

specific signal transduction protein that binds to and 
receives signals from members of the* low-density lipoprotein 
receptor family. Dab2 contains a phosphotyrosine-binding 
domain which binds peptides containing the sequence FXN-PXY. 

20 Dab2 may therefore be both a signal transduction protein and 
an adaptor protein that regulates protein trafficking- See 
Morris, ^^Disabled-2 colocalizes with the CDLR in clathrin- 
coated pits and interacts with AP-2", Traffic, 2(2):lll-23 
(2001) . It also may be a transcriptional activator, see 

25 Cho, et al., Biochem. J., 352:645-50 (2000). 

This clone was identified as being up-regulated in 
kidneys of 5 month old bGH mice as compared to age-matched 
nontransgenic control mice, but this differential expression 
has not been confirmed by Northern analysis. 

30 . 

In like manner, additional clones have been identified, 
with Northern analyses (NT, nontransgenic; bGH, transgenic 
for bGH) given below: 

35 B3: NT>bGH; Unknown (SEQ ID NOs:19, 20) 



B46: NT>bGH; NAS-hypothetical protein-putative (SEQ ID 
NO:33) 
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Exanqple 201 

Use o£ Somatic Gene Therapy with KAP xsSJSIA to Protect 
Against Diabetic Nephropathy. 

Specific Aims 

Kidney damage- is a frequent complication of both type I 
and type II diabetes/ often ending in kidney failure, or. 
end-stage renal disease (ESRD) . Diabetic nephropathy, the 
single itlost common, cause of ESRD, is a progressive disease 
that takes several* years to develop and often goes 
undiagnosed. Our long-term goal is to design specific, 
targeted markers* and therapeutic approaches for the 
diagnosis, treatment, and prevention of human diabetic 
kidney disease. Toward this end, we have identified a gene, 
encoding kidney androgen-regulated protein (KAP) , whose 
expression decreases markedly with increasing kidney damage, 
in both a diabetes-dependent and a diabetes-independent . 
mouse model of glomerulosclerosis. We believe that this 
.gene, through manipulation of its expression, could provide 
a means for protecting the kidney from damage. This has 
tremendous therapeutic potential since KAP is a relatively 
small peptide, is possibly secretecl, and appears to have a 
human homologue. Based oh our findings, we propose the 
following hypothesis and specific aims - 

Hypothesis: Maintenance of high levels of KAP 
expression will protect the kidney from glomerular 
hypertrophy and diabetic damage. We have observed high 
levels of KAP mRNA isolated from kidneys of nondiabetic 
female mice. In comparison, KAP mRNA levels are 
dramatically decreased in kidneys of streptozotocin (STZ)- 
induced diabetic mice. Furthermore, BCAP mRNA levels 
progressively decline with age in nondiabetic bovine growth 
hormone (bGH) -transgenic mice. These mice exhibit 
increasingly severe glomerular hypertrophy and eventually 
glomerulosclerosis, hallmarks of the early stages of human 
diabetic nephropathy. Since low levels of KAP mRNA are 
associated with kidney damage,, we hypothesize that 
maintenance of high levels of KAP expression will protect 
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the kidney from damage. We will test this hypothesis by 
expressing KAP in the kidney using a heterologous promoter 
and then assessing kidney damage after induction of 
diabetes. 

Specific Aim 1: To constitutively express KAP in the 
mouse kidney. KAP expression is normally restricted to 
proximal renal tubule cells and is under the control of 
several different hormones. ' In order to obtain constitutive 
expression, of KAP, the full-length mouse cDNA encoding KAP 
will.be placed under the • direction of the kidney-specific 
mouse gamma-glutamyl transpeptidase (GGT) type II promoter • 
and transferred into mouse embryos by pronuclear 
microinjection. Individual .lines will be established from 
several founding animals expressing the KAP transgene. In 
anticipation that expression from .the kidney-specific 
promoter might be^lowj additional mouse- lines will be 
generated with the KAP cDNA placed Under the direction of 
the stronger mouse metallothionein-I (MT) promoter that 
expresses in several other tissues in addition to the . 
proximal tubule3. 

Specific Aim 2: To induce diabetes in the KAP- 
transgenic mice and assess the degree of kidney damage. To 
test whether KAP expression can prevent diabetic 
nephropathy, diabetes will be induced in KAP-transgenic mice 
as well as nontransgenib control mice by injection of 
streptozotocin. Glomerular hypertrophy, mesangial cell 
expansion and urinary albumin excretion (UAE) will be 
assessed following a period of hyperglycemia. Observance of 
kidney measurements for the diabetic KAP-transgenic mice 
that more closely resemble those of nondiabetic rather than 
diabetic nontransgenic controls will support the hypothesis 
that maintenance of KAP expression will protect the kidney 
from diabetic damage. 

Characterization of the kidney androgen-regulated protein 
gene 

Expression of the KAP gene in mouse kidney is under the 
complex control of several hormones, including androgen. 
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estrogen, thyroid hormone and the growth hormone/insulin- 
like growth factor-I (GH/IGF-I) axis (Cebrian et al., 2001; 
Meseguer and Catterall, 1990; Meseguer and Catterall/ 1992) . 
Expression is confined almost exclusively to epithelial 
cells of the renal proximal tubule (Meseguer and Catterall, 
1987). Expression in the cortical SI and S2 segments is 
androgen-dependent while expression in the medullar S3 
segment is androgen-independent (Meseguer and Catterall, 
1990; Meseguer et al., 1989). Expression of KAP also occurs 
..in -the mouse uterus^ but only during the perinatal period ' 
(Cebrian et al . , 2001; Kasik and Rice, 1993). 

The KAP gene contains four exons and three intervening 
sequences that* span 3807 nucleotides of the mouse genome 
(Niu et al./ 1991) . Its mRNA is .611 nucleotides in length*. 
- (Niu et al., 1991). KAP gene* regulation occurs primarily at 
the level of transcription with protein expression closely 
paralleling mRNA expression. The 121 amino acid protein 
product of KAP migrates with an apparent molecular mass of 
20 kDa (Cebrian et al., 2001; Meseguer et al., 1989).. A 
hydrophobic N-terminal domain forms a putative 18 amino acid 
signal peptide (Meseguer et al., 1989). The remainder of 
the protein is highly negatively charged and contains sunino 
acid clusters similar to those associated with proteins with 
short half-lives (Meseguer et al., 1989). 
Immunohistochemical assays of mouse kidney sections 
identified the protein in the same tissues and at the same 
•relative levels as the mRNA (Cebrian et al . , 2001). 
Although the function of KAP remains elusive, it has 
recently been determined that the protein specifically 
interacts with cyclophilin B and that overexpression of ECAP 
in stably transfected proximal tubule cells significantly 
decreases the toxic effects of cyclosporine A (Cebrian et 
al., 2001). 

Importance 

Up to 15.7 million Americans, or approximately 5.9% of 
the population, are thought to have diabetes. While two- 
thirds have been diagnosed, one-third are not aware that 
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they have the disease. Many people first become aware that 
they have diabetes when they develop one of its 
complications/ such as kidney disease, heart disease, nerve 
disease or blindness. Ten to twenty-one percent of all 
5 people with diabetes eventually develop kidney disease. 
Diabetic nephropathy is the most common cause of ESRD/ a 
condition that requires dialysis or a kidney transplant for 
survival. Approximately 40% of the new cases of ESRD are 
attributable to diabetes, and this incidence iis increasing 

10 more rapidly than any other cause. 

Amino acid sequence analysis suggests that KAP may be a- 
small, secreted protein (Meseguer et al-, 1989). In 
addition, a human cDNAhas been identified that is 99% 
identical to the mouse cDNA at the nucleotide level and 100% 

15 identical at the amino acid level. If shown to have 

protective abilities, KAP could be utilized as a therapeutic 
drug for the prevention of diabetic. nephropathy in humans. 
Preliminary Studies 

We have utilized two different mouse models that 

20 • .develop kidney damage by presumably independent means, the 
streptozotocin. (STZ) -induced type I diabetic mouse and the • 
MT/bGH-transgenic mouse, A PCR-based method of subtractive 
hybridization was used to create libraries that contained 
cDNAs either up-regulated or down-regulated in the damaged 

25 kidney in comparison to the undamaged control kidney. A 

cDNA encoding KAP was identified in the libraries of down- 
regulated cDNAs from both mouse models of kidney damage. 
Data describing the assessment of kidney damage and showing 
the down-regulation of the KAP gene in the two mouse models 

30 are presented below. 

Analysis of kidney damage in streptozotocin-induced diabetic 
mice 

Diabetes was induced by multiple injection of STZ as 
35 described by Bellush et al. (2000). In short, 2-month-old 

female 129 Ola/BalbC mice received daily injections of 85 mg 
STZ/kg body weight until their blood glucose levels were at 
least 250 mg/dl . Control mice received a similar number of 
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injections of citrate buffer without the STZ. Within one 
week, most of the STZ-injected (DB) mice had blood glucose 
levels of 300-500 mg/dl that were maintained throughout the 
duration of the study. The blood glucose levels of 
nondiabetic (ND) control mice remained below 100 mg/dl. 
Mice were sacrificed 4 or 10 weeks after the onset of 
diabetes. Several parameters reflecting the extent of 
kidney damage were. assessed. Glomerular hypertrophy was 
observed in the DB mice in comparison with the ND mice at 
both the 4 and 10 wk timepoints. (Fig. lA) A significant 
elevation of urinary albumin excretion (UAE) was observed 
only at the 10 wk timepoint. (Fig. IB) Both changes are 
indicative of kidney damage. Measurements of mesangial cell 
proliferation, another indicator of damage, are in progress. 



Decreased KAP mRNA expression in diabetic kidneys • 

In an effort to identify genes involved in the 
progression of. diabetic nephropathy, we performed a PCR- 
based method of subtractive . hybridization using RNA 
extracted from kidneys of streptozotocin- induced (type I) 
diabetic female mice and nondiabetic .controls collected 10 
weeks after the establishment of diabetes. Two cDNA 
subtraction libraries were created, one (DB) containing 
cDNAs up-regulated in the diabetic kidney in comparison to 
the, nondiabetic kidney, and one (ND) containing cDNAs down- 
regulated in the diabetic kidney. cDNAs exhibiting high 
sequence similarity to the KAP nucleotide sequence were 
isolated multiple times from the library of down-regulated 
cDNAs. Northern analysis confirmed the decreased expression 
in the diabetic kidney. 

Analysis of kidney damage in MT/bGH-transgenic mice 

Expression of. the bovine growth hormone (bGH) cDNA 
under the direction of the metallothionein-I promoter (MT) 
results in a giant mouse that is approximately 1.5-2 times 
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the size of nontransgenic mice (Chen et al,, 1990; Quaife et 
al.r 1989). These MT/bGH mice develop severe kidney damage 
and die prematurely (Quaife et al., 1989). We have assessed 
several parameters reflecting the extent of kidney dainage in 
these mice as a function of age. Significant increases in 
glomerular volume (as determined by Analysis of Variance 
(ANOVA) ; P<0.05) were observed in the MT/bGH mice in 
comparison with the nontransgenic. (NT) mice at the 2 and 5 
month timepoints but not at 12 months (Fig. 2A) . A 
significant increase in mesangial cell area 'was observed 
only at the 12 month timepoint (Fig. 2 B) . A significant 
elevation of urinary albumin excretion (UAE) was also 
observed only at the 12 month timepoint (Fig. 2C) . Light 
microscopic images of NT and bGH kidney sections show 
glomerular hypertrophy in the bGH kidney sample along with a 
thickened Bowman's capsule. The observed changes seen in 
the bGH mice are indicative of kidney damage. 

Decreased KAP mRNA expression in the kidneys of MT/bGH- 
transgenic mice 

In an effort to identify genes involved in the kidney 
damage resulting from expression of the bGH transgene, we 
performed a PCR-based method of subtractive hybridization 
using RNA extracted from kidneys of 12-month-old MT/bGH- 
transgenic mice and their nontransgenic littermates. Two 
cDNA subtraction libraries were created, one (bGH) 
containing cDNAs up-regulated in the MT/bGH kidney in 
comparison to the nontransgenic kidney, and one (NT) 
containing. cDNAs down-regulated in the MT/bGH kidney.^ A 
cDNA exhibiting high sequence similarity to the KAP 
hucleotide sequence was isolated from the library of down- 
regulated cDNAs. Northern analysis confirmed the decreased 
expression, in the MT/bGH kidney as a function of age that 
correlated with degree of kidney damage. 

Research Design and Methods 

We have demonstrated that KAP mRNA expression is 
inversely correlated with kidney damage (i.e. KAP expression 
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decreases with increasing kidney damage) . This decrease in 
KAP gene expression could either be the cause of the kidney 
damage or could result from the kidney damage. One way to 
test the former possibility would be to knock out expression 
of the KAP gene and assess whether kidney damage occurs 
independently of diabetes. In a converse experiment^ high 
levels of KAP expression could be maintained and the kidneys 
assessed for protection from damage when diabetes is 
induced. The second experiment would further • support KAP as 
a therapeutic drug for the prevention of diabetic 
nephropathy if shown to be protective while the* first 
experiment would further support KAP as a drug target for 
the prevention of kidney damage. Due to the practical, 
outcome of the second experiment, we propose to test the 
hypothesis that maintenance of high levels of KAP expression 
will protect the kidney from glomerular hypertrophy and 
diabetic damage. To do this, we will create a transgenic 
mouse line that expresses KAP in the kidney under the 
direction of a heterologous promoter such that KAP 
expression will no longer decrease with increasing kidney 
damage. We will then challenge the kidney by inducing 
diabetes in the KAP-transgenic mouse and assessing the 
degree of kidney damage. The hypothesis will be considered 
proven if the kidneys of diabetic KAP-transgenic mice more 
closely resemble nondiabetic rather than diabetic 
nontransgenic control kidneys. Confirmation of the 
hypothesis would further support of KAP as a therapeutic 
drug to prevent diabetic nephropathy. 

Specific Aim 1: To constitutively express KAP mRNA in the 
mouse kidney 

Rationale 

It has been demonstrated that KAP mRNA levels decrease 
with increasing kidney damage. In order to prevent this 
decrease in expression, the KAP cDNA will be placed under 
the control of a heterologous promoter that should not be 
affected by the state of diabetes (use of the term 
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^^promoter" in this proposal includes the region of the gene 
required to initiate transcription as well as regions that 
•regulate expression) . Although a number of genes are 
expressed in the renal proximal tubule, very few have been 
5 shown to be expressed only in those cells. Furthermore, 
very few of the promoters of these genes have been 
characterized well enough to demonstrate cell-specific 
expression of a reporter gene. A search of the literature 
has indicated two different promoter regions that may 
..10 provide suitable transcriptional ' regulation. The first, a 
34 6-base pair region from the type II promoter of the mouse 
gene encoding gamma-glutamyl transpeptidase (GGT) , has been 
shown to direct expression of a B-galactosidase reporter 
gene specifically in the renal proximal tubules of 

15 transgenic mice (Sepulveda et al., 1997). Although the GGT 
-promoter has ^ the*^ correct tissue-specific expression, there 
is some question as to whether the level of expression will 
be adequate. The second promoter selected is from the mouse 
metallothionein-I gene. Although it was reported to direct 

20 ■ expression of human growth hormone-releasing hormone in 
renal proximal tubules, it also directed expression. in a 
number of other tissues, including pancreas, lung, heart and 
brain (Brar et al., 1989). Since an ^^ideal" promoter has 
not been found, both of these promoters will be used and the 

25 results compared. 

. Experimental design 

a) Construct a. plasmid fusing the KAP cDNA to a 
heterologous promoter that will drive KAP expression in the 

30 Jtldney. An expression plasmid routinely used in our 

laboratory will serve as a starting point for constructing 
the heterologous promoter/ECAP cDNA fusion. This plasmid 
utilizes the mouse metallothionein-I promoter to initiate 
transcription. The plasmid also has a bGH polyadenylation 

35 signal sequence that defines the site of poly A addition. • 
The 611 bp cDNA for BCAP, isolated from our subtraction 
libraries, will be inserted between the promoter and the 
polyadenylation signal sequence, generating the MT/KAP 
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fusion construct. Nucleotide probes complementary to the 
bGH sequence just upstream of the poly A addition site will 
be used to distinguish transgenic KAP mRNA expression from 
endogenous KAP mRNA expression. A protein tag will not be 
added to the protein sequence since, without a known 
function for KAP, it will be impossible to know if the tag 
interferes with function. The.plasmid also has a bacterial 
origin of replication and an ampicillin resistance gene for 
propagation in E. coli. Replacement of the MT promoter 
region with the 346 bp GGT promoter. (Sepulveda et al . , 1997) 
will generate the GGT/KAP fusion - construct * All manipulated 
regions of the expression plasmids will be sequenced to 
confirm that no errors were introduced. 

b) Confirm mRNA and protein expression from the fusion 
construct in cell culture. Before injection into mice, 
expression from each KAP fusion construct will' be confirmed 
in cell culture. Mouse L cells, used routinely in our 
laboratory, will be transiently transfected with the KAP 
fusion constructs using lipofectin. Transfected and 
nontransfected control cells will be harvested and either 
RNA or protein extracted. RNA will be analyzed by Northern . 
blot hybridization as described in the Preliminary Results 
section using a probe complementary to the bGH sequence at 
the 3' end of the KAP transgene message. Protein will be 
analyzed for KAP expression by Western blot analysis using 
monoclonal antibodies as described by Cebrian et al . , 2001 
(2001) . Since a protein tag will not be utilized for the 
transgene, protein expression paralleling the transgenic 
mRNA expression and differing from normal expression will be 
evaluated/ In the event that the GGT promoter is not 
functional in mouse L cells, a mouse proximal tubule cell 
line will be tried (Sepulveda et al., 1997). Our laboratory 
has considerable experience performing Northern and Western 
blot analysis. 

c) Create transgenic mouse lines carrying the KAP 
transgene. The pronuclear microinjection technique into 
single-cell fertilized embryos will be utilized to deliver 
the KAP transgene into the mouse genome. Our transgenic 
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mouse facility has been using the technique since the mid- 
1980s (McGrane et al., 1988). Since then^ our facility has 
optimized the technique and created hundreds of transgenic 
lines for Ohio University researchers as well as outside 
5 researchers. Pups obtained from the microinjections will be 
screened for the presence of the KAP transgene by DNA slot 
blot and Southern- blot hybridization. The number of 
integrated KAP transgene .copies will be quantitated by 
comparison to known copy .number standards. Transgenic lines 

10 will be established by crossing the transgenic founders to • 
nontransgenic mice and inheritance of the KAP transgene 
confirmed by DNA slot blot hybridization. These techniques 
are performed routinely in our laboratory. 

d) Confirm mRNA and protein expression from the fusion 

15 .construct in the KAP-transgenic mouse. lines. Fl progeny 
from the KAP transgenic lines will be assayed for KAP 
transgene expression. Transgenic mice and their 
nontransgenic littermates will be sacrificed by cervical 
dislocation and several organs (kidneys, pancreas, liwg/ 

20 heart and brain) immediately frozen. RNA or protein will be 
extracted from the tissues and then analyzed by Northern or 
Western blot analysis as described above. Tissues other 
than kidney will be included in order to determine the 
degree of tissue specificity. Lines expressing the highest 

25 levels of KAP from the transgene will be expanded for 
further analysis (see below) . 

Analysis 

The experiments proposed for Specific Aim 1 should 
30 result iri generation of two different transgenic mouse lines 
that express KAP in renal proximal tubules. However, 
several potential problems could be encountered. First, the 
selected promoters may not direct sufficient or appropriate 
expression. One way to increase expression from the MT 
• 35 promoter is to supplement the mouse drinking water with ZnS04 
(Eisen et al., 1998). Other promoters that could be 
considered include the mouse phosphoenolpyruvate 
carboxykinase (PEPCK) promoter or the cytomegalovirus (CMV) 
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promoter • Although the PEPCK promoter has been shown to 
direct expression in renal proximal tubules, as well as 
other tissues (Short et al., 1992), it was questionable as 
to whether induction of diabetes would influence its ability 
to direct transcription. Likewise, although CMV does direct 
expression in the kidney, it was not clear whether 
expression occurred in the proximal tubules. 

Another potential concern .is use- of the KAP cDNA for 
expression. ■ Sometimes we. find that inclusion of an intron • 
increases expression. Alternative constructs could include 
the 126 bp second intron of KAP or .the 3807 bp genomic clone 
(Niu et" al. , 1991) . 



Specific Aim 2: To induce diabetes in the KAP-transgenic 
mice and assess" the degree of kidney damage 

Rationale 

In order to determine if KAP mRNA expression (and 
subsequent protein production) can protect the kidneys of 
diabetic mice from damage, diabetes* will be induced in the 
KAP-transgenic mice generated under Specific Aim 1. After 4 
to 10 weeks of diabetes, diabetic and nondiabetic KAP- 
transgenic and nontransgenic mice will be sacrificed and 
their kidneys examined. If the parameters measured for the 
diabetic KAP-transgenic mice (glomerular volume, mesangial 
cell expansion, and urinary albumin excretion) are 
significantly different from diabetic nontransgenic mice and 
in fact more closely resemble those for nondiabetic, 
nontransgenic mice, then the hypothesis will be considered 
proven with evidence suggesting the use of KAP as a . 
therapeutic drug for the prevention of diabetic kidney 
damage . 

Experimental design 

Diabetes will be induced in twenty-five 2-month-old 
KAP-transgenic and nontransgenic littermate females by 
multiple injection of STZ as described in the Preliminary 
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Results section. Nondiabetic controls (twenty KAP- 
transgenlc and twenty nontransgenlc littermates) will be 
injected with citrate buffer only. After 4 or 10 weeks of 
hyperglycemia (blood glucose levels 300-500 mg/dX) , urine 
will be collected from diabetic KAP-transgenic mice and the 
three control groups (diabetic nontransgenic, nondiabetic 
KAP-transgenic and nondiabetic nontransgenic; 8-10 
mice/group). The mice will be sacrificed by cervical 
dislocation and tissues (kidneys, pancreas, lung, heart, 
brain, and liver) immediately removed. • The right kidney 
will-.be quickly frozen for later .RNA and protein extraction 
and the left kidney will be bisected and placed in 4% 
paraformaldehyde . RNA and protein will be examined by 
Northern and Western blot analyses to determine levels of 
transgenic and endogenous KAP expression as described above. 
Glomerular volume, mesangial cell expansion and "UAE will be 
measured as described in the Preliminary Results section. 
Statistically significant differences will be determined by 
ANOVA. 

Analysis 

Performance of Specific Aim 2 should establish whether 
expression of KAP in kidney using a heterologous promoter 
can protect the kidney from diabetic damage. If KAP 
expression protects the kidney, then the measurements of 
glomerular volume, mesangial cell expansion and UAE should 
not differ significantly between diabetic, KAP-transgenic 
mice and nondiabetic, nontransgenic mice. Instead, there 
should be a significant difference between diabetic, KAP- 
transgenic mice and diabetic, nontransgenic mice. 

One potential problem that might be encountered is an 
effect of diabetes on expression by the heterologous 
promoters. Northern and Western blot analys'es of several 
different tissues, including kidney, should help determine 
if this is a problem. Also, it is unlikely that both 
promoters will be affected. 

Another possible scenario is that expression of the 
KAP-transgene will affect the measured kidney damage 
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parameters (glomerular volvimer mesangial cell expansion and 
UAE) differently. For example, diabetic, KAP-transgenic 
mice may no longer exhibit mesangial cell expansion and 
increased UAE (parameters that typically change during the 
5 later stages of diabetes in diabetic, nontransgenic mice; 

see Preliminary Results section), but may exhibit glomerular 
hypertrophy. This differential effect may help pinpoint the 
role of KAP in the kidney. Interestingly, in contrast to 
our results, KAP mRNA was reported as being slightly 

10 increased (1.5 X) in remnant kidneys exhibiting glomerular 
hyperf iltration two weeks after undergoing subtotal renal 
ablation (5/6 nephrectomy) (Zhang et al . , 1999). A possible 
explanation is that KAP expression increases during the very 
early stages of diabetes (prior to our earliest timepoint, 

15 perhaps in an- effort to protect the kidney) but then 
decreases as diabetes progresses. 

Another potential problem is that the heterologous 
promoters may not be - able to drive sufficient expression of 
KAP for it to protect against diabetic damage. KAP appears * 

20 to be highly expressed in our female mice, as shown in the 
Preliminary Results section. We have not assayed KAP 
expression in our male mice, but reports in the literature 
• indicate that KAP ekpression is highest in males and lower 
in females and castrated males (Cebrian et al., 2001). 

25 Achievement of even higher levels of BCAP may require 

characterization of a similarly expressed gene promoter, 
such as that for the gamma phosphorylase kinase gamma- 
subunit (Takenaka et al . , 1998). 

The possibility also exists that increased KAP 

30 expression will not protect the kidney from damage, but that 
the decreased expression results from the development of 
kidney damage. If this appears to be the case, then 
decreased KAP expression could potentially be used as a 
diagnostic marker for kidney damage- 
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Future direction 

As demonstrated in the Preliminary Results section^ KAP 
mRNA expression also decreases with increasing kidney damage 
as a function of age in MT/bGH-transgenic mice. Since these 
5 mice do not become diabetic, their kidney damage presumably 
occurs via a different mechanism. Thus, they provide an 
independent model for the study of progressive glomerular 
hypertrophy and glomerulosclerosis. Although completion 
lies beyond the time frame of this proposal, we hope to 

•10 . initiate a study that combines the bGH transgene and the KAP. 
transgene in a double-transgenic mouse through genetic 
breeding. The double-transgenic mice, and single-transgenic 
controls, will, then be sacrificed at 2, !5, or 12 months of 
age and their kidneys examined for KAP transgenic and 

15 endogenous expression and .tor glomerular hypertrophy and 
mesangiar cell expansion as described for Specific Aim 2. 
Data demonstrating that KAP expression protects the kidney 
from damage due to the bGH transgene would further support 
and even broaden the role of KAP in protecting the kidney 

20 from damage. 

Adaptation to Human Therapy 

To adapt the gene therapy system for humans, we could 
use somatic cell gene therapy. Any vector that delivers DNA 

25 . into the nucleus (i.e. viral or non-viral methods) as well 
•as any promoter that could direct constitutive, high levels 
of expression could potentially be used. Since KAP may be a* 
secreted protein, it may not be important to direct 
expression only to kidney cells. Although we anticipate 

30 that ECAP would protect the kidneys, it could possibly 
protect other organs as well. KAP would preferably be 
administered/maintained as long as the patient is diabetic 
and susceptible to kidney damage. Administration route 
could be any of the following: subcutaneous, intravenous, 

35 intramuscular. 

KAP could potentially be used in humans as a 
^^conventional'' pharmaceutical, especially if it is a 
secreted protein and thus would normally be found 
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circulating throughout the body. The administration route 
could be any that would keep the protein intact, (e.g. 
subcutaneous, intravenous, intramuscular) . 
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Citation of documents herein Is not Intended as an 
admission that any of the documents cited herein Is 
pertinent prior art, or an admission that the cited 
documents Is considered material to the patentability of any 
of the claims of the present .application. All statements as 
to the date or representation as to the contents of these 
documents Is based on the Information available to the 
applicant and does not constitute any admission as to the 
correctness, of the dates or contents- of these documents . 

The appended claims are .to- be treated as a non-llmiting 
recitation of preferred .embodiments . ■ 

In addition to those, set forth elsewhere r the following 
references are hereby incorporated by reference^ In their 
most recent editions as of the time of filing of this 
application: Kay^ Phage Display of Peptides and Proteins: A 
Laboratory Manual/ the John Wiley, and Sons Current Protocols 
series including Ausubelr Current Protocols In Molecular 
Biology/ Coligan^ Current Protocols in Protein Science/ 
Collganr Current Protocols in Immunology/ Current Protocols 
In Hvman Genetics/ Current Protocols in Cytometry/ Current 
Protocols in Pharmacology/ Current Protocols in 
' Neuroscience/ Current Protocols in Cell Biology/ Current 
Protocols in Toxicology/ Current Protocols in Field 
Analytical Chemistry/ Current Protocols in Nucleic Acid 
Chemistry/ and Current Protocols in Human Genetics/ and 
the following Cold Spring Harbor Laboratory publications: 
Sambrook^ Molecular Cloning: A Laboratory Manual/ Harlow^ 
Antibodies : A Laboratory Manual/ Manipulating the Mouse 
Embryo: A Laboratory Manual/ Methods in Yeast Genetics: A 
Cold Spring Harbor Laboratory Course Manual/ Drosophlla 
Protocols/ Imaging Neurons: A Laboratory Manual/ Early 
Development of Xenopus laevls: A Laboratory Manual/ Using 
Antibodies : A Laboratory Manual/ At the Bench: A Laboratory 
Navigator/ Cells: A Laboratory Manual/ Methods in Yeast 
Genetics: A Laboratory Course Manual/ Discovering Neurons: 
The Experimental Basis of Neurosclence/ Genome Analysis : A 
Laboratory Manual Series / Laboratory DNA Science/ 
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Strategies for Protein Purification and Characterization: A 
Laboratory Course Manual/ Genetic Analysis of Pathogenic 
Bacteria: A Laboratory Manual/ PCR Primer: A Laboratory 
Manual/ Methods in Plant Molecular Biology: A Laboratory 
Course Manual / Manipulating the Mouse Embryo: A Laboratory 
Manual/ Molecular Probes of the Nervous System/ Experiments 
with Fission Yeast: A Laboratory Course Manual/ A Short 
Course in Bacterial Genetics: A Laboratory Manual and 
Handbook for Escherichia coli and. Related Bacteria; DNA 
Science: A- First Course in . Recombinant DNA Technology/ 
Methods in Yeast Genetics: A Laboratory Course Manual/ 
Molecular Biology of Plants: A Laboratory Course Manuals 

. All references cited herein^ including journal articles 
or abstracts^ published^ corresponding r prior or otherwise 
related U.S. or foreign patent applications^ -issued U.S. or 
foreign patents r or any other references ^ are entirely 
incorporated by reference herein/ including' all data, 
tables / figures r and text presented in the cited references. 
Additionally f the entire contents of the references cited 
within the references cited herein are also entirely 
incorporated by reference. 

Reference to known method steps, con\rentional methods • 
steps r known methods or conventional methods is not in any 
way an admission that any aspect^ description or embodiment 
of the present invention is disclosed^ taught or suggested 
in the relevant art. 

The foregoing description of the specific embodiments 
will so fully reveal the general nature of the invention 
that others can, by applying knowledge within the skill of 
the art (including the contents of the references cited 
herein) , readily modify and/or adapt for various 
applications such specific embodimentSf without undue 
experimentation , without departing from the general concept 
of the present invention. Therefore/ such adaptations and 
modifications are intended to be within the meaning and 
range of equivalents of the disclosed embodiments^ based on 
the teaching and guidance presented herein. It is to be 
understood that the phraseology or terminology herein is for 
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the purpose of description and not of limitationr such that 
the terminology or phraseology of the present specification 
is to be interpreted by the skilled artisan in light of the 
teachings and guidance presented herein^ in combination with 
the knowledge of one of ordinary skill in the art. 

Any description of a class or range as being useful or 
preferred in the practice of the invention shall be deemed a 
description of any subclass (e.g.f a disclosed class with 
one. or more disclosed members omitted) or subrange contained 
therein r as well as a separate description of each 
individual member or value in said class or range. 

The description .of preferred embodiments individually 
.shall be deemed a description of any possible combination of 
such preferred embodiments^ except for combinations which 
are impossible fe.g, mutually exclusive choices for an 
element of the invention) or which are expressly excluded by 
this specification. 

If an embodiment of this invention is disclosed in the 
prior art, the description of the invention shall be deemed 
to include the invention as herein disclosed with such 
embodiment excised.' 
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Table 1: Sequence of Clone G26, related to hydroxysteroid 
dehydrogenase-4 

lA: beginning of coding strand (SEQ ID NO:l) 

5 

actcagctcctacnggntgcttgtgtggaagccaacgtgccaacattcatctacagcagc 
tcagtgcttgtggctggaccaaattcctacaaggagatcatcctgaatggccatgaggag 
gagcatcatgaaagcacatggtctaacccatacccatgnnnngtgaaagatggctgagaa 
ggcagtgctggcaacccntgggagcatcctgaaaaatggtggcactttgcatacttgtgc 
. 10 cttaagactctctttcatctatggggaagaatgccaagtcacttcaaccactgtgaaaac 
- * agcactgaagaacaacagcataattaagaaagatgccacattctccattgccaacccagt 
gtatgtgggcaatgcancctgggctcacattctggctgccaggagcctacaggaccctaa 
aaagtccccaagcatccaaggacagttctattacattaacagatgacacccctcaccaaa 
gcnatgatgacttnaaaatgcaccctgagcaaggagtgggggcccncccccttganacca 
15 actgggaagtctttcctctgccccctgctntactnggcnttgccttncctccccnggaaa 
actggggaaccttcctgctaccnnccannctaccaacttannnaaccccnttttaaaccc 
cccnttggaccnccanngnnaaaaaaangnnntncncccnntcctaccaaaaaggcncaa 

• cnnnhaacnnggcnttnancnnnttng 

20* IB: end of coding strand, obtained by sequencing opposite 
strand, deducing complement, and arranging in 5 '-3' order 
(SEQ ID NO: 2) 

ngggcttncggcccnnananntcccaagcnnccaggnnggtttnttcctttcnangnncc 
25 cctcaccaaaagtatgntnattt^aaagacccttgancaaagantggggccttngctttg 
atacanntggagtntttctntggcccnnctncnatnggcntgctttcntcttnaaactnt 
gagnntccnggntacgtccagtnttncaaaataaaancccgcccatttaaccgcctcttg 
atcacagtgnttaaanagtgatccacctctcctanaagaaggtcagngagatttgggcta 
tgagccaacttgtcagctgggaggnagccaagcagaaaacctcagagtggatcggaaaca 
30 ctagtgatgcagcacagggagattggaaacaaaaagtcncagtgatatgaagagggagag 
gacatggccctgggtgttattaggtcctccagaaagggacttagaaacaatccaactctt 
aacaattccattttacactctgtccaacttgtctttgggtcaccagaagccttgcaagtc 
actggcccagttgcaacccttcagctntaagcacttgtccagcaatgcacaagatgtgcc 
tcagctgcngtgacccaaggatgggtggctgatagtgagttgcctggaacctcttgtagg 
35 ttagaatttcatcnnggcttccacatctcctttccatgtgccaacgcatttcgtgtctga 
gaaaattcccatacctttatgaagctcaaagaatagaacacataaaatcttttaatgcgt 

aaaaaaaatnaaaaaaaaa 



40 

Table 2: Sequence of Clone H8 (SEQ ID lSIO:3), related to 
vacuolar adenosine triphosphatase subunit D; 

acttcaatgtggaccatggctacctggagggcctggttcgaggatgcaaagccagcctcc 
45 taactcagcaggactatgtcaacctagtgcagtgtgagacctcggaagacctgaaaattc 
atctccagaccacggactatggcaacttcctggctaatgaaacaaatcctctcactgttt 
ccaaaattgacacggagatgaggaagaagttctgcagagagtttgactatttccggaatc 
attccttggagtccctgagcacatttctcacctacatgacatgcagctatatgatagaca 
atataattctacttatgaatggggccttgcaaaagaaatctgtgaaagaagttctagcca 
50 agtgtcacccactgggccgtttcacagagatggaagctgtcaacattgcagagaccccct 
cagatctcttcaggcttgtgctggttgaaacaccattagctccattctttcaagattgta 
tgtcttgaaaacacctcttgatgaacttgaatattgaattactgcgcaataaactataca 
agtcttaccttgaggcattctacaaattctgcaaggatcacggtgatgtcacagcagacg 
ttatgtgtcccattcttgagtttgaggccgacagacgcgctttaatcatcactctgaact 
55 catttggcactgaactaagcaaagaagacagggagaccctcttccccacctgcggcaggc 
' tctatccagaggggttgcggttgttagctcaagctgaagactttgagcagatgaagagag 
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tggcagataattatggagtttacaagcctttgtttgactgctgtcggtggcagtgggggg 
aagacactggaagacgttttctatgagagagaggt 

Table 3: Sequence of Clone HI (SEQ ID NO:4), related to 
5 ubiquitin protein ligase Nedd-4 

acatacgtgtttacggagttcattatgtttacagattaagcgaatttctgtagttgcatt 
tttatatttttagtatcacattagtataaaaatttgtttaaaatagccaaagagtagttc 
aatgcataatttgtatgaatttgttaccaagtttcttatgctttctaaaaaatactgttt 

10 cctatgaaaattatgtttaatcaaaagtcaagaacccttggggcaaaatgctacaggtgg 
agtcccaccatagtcacctttgaggcacaggaagaggttccatgcatgaatctgcacaca 
tgagcagtgctggctgctgtgcaccgggcccttgggatgaagccatcagcactgacatcc 
agccccgtcttcatcctcaggcttagaaacaagtgggatggtgactttacagtcagtggg 
aagtaagagcccgcagaataacaaaggcagaaggacaggaaagcgaggtccgagtccgac 
. .15 tgtgaggtcatatctggccaaggctccttctagtcagcttacactggtgattatacggaa 
aatgctcttagatatgagaaaagcaatgtggcatgtaatgagttagtgtaagcttgtgtt 
ataaactttcc.gaatgcttataagtagcctgtgggagctgtaagaaatacttagggccat 
. . tgatttaagcttgttggctctaacatgactaatacctgtcagcccatggtagggatttca 
gtgacagtagtggaaacgctttccacagtccacaccactatgttgatcggtgaaagacac 

20 ccccaggccagtgcatgcttccagagtgtcctgatcagtcagccaggcatggGcaccgtc 
cagtgttgatttttatcattagccagctgagcctgcagcacggggctcatttcccogtct . 
ggcagatgtggatagctgggcccactcattctgtctctcaaggctatggtaagttcttaa 
-aggggt 

25 • Table 4: Sequence of Clone Fl (SEQ ID NO:5)|. related to 
truncated SON protein.. 

acaagcccgacttgaaagaggcct'ccagaaaaagtagatgcgtatctgtgcaaacagatc 
ctactgatgaagtgcccaccaaaaagtcaaagaagcataaaaagcacaaaaacaaaaaga 

30 agaaaaagaagaaagaaaaagaaaaaaaatataaaagacaaccagaagaatctgagtcca 
agctgaaatctcatcatgatgggaatctagaatctgattccttcttaaagtttgattctg 
aaccttcagcagcggcact-ggaacatcctgtaaga'gcgtttggcttatctgaggccagtg 
agaccgccttagtgctggaacctccagtagtctcaatggaggttcaggagtcacacgttc 
tagagactctgaagccagctacaaaagctgcagaactgtcagttgtgtctacatcagtaa 

35 tctcagagcagtctgagcagccgatgccaggtatgctggaaccatcaatgacaaagatcc 
.tggattccttcacagcagcaccagtggcctatgtcaacagcggcgctgaagtcacctgag 
ccccgttgtaacaaatgtcagtggaatatcagaagtctgtgctgaaatctttggagacta 
tgcctccagagacgtcaaagaccacgctggtagagcttcccatagcaaaagtggttgagc 
catcagaaaccctcacgatagtgtcagagacacctactgaggtgcaccctgaaccaagcc 

40 catcaacaatggattttccagagtcatctacaactgacgtgcaaagattgcQagagcagc 
ctgtagaagcaccatcggagattgcagattcatccatgacaa^acctcaggagtcactgg 
agctgcctaagaccacagcggtggagctgcaggagtcaacggtggcctcagctctggagt 
tgccggggccacctgcgacctccattctggagttgcaggggccccctgtgactccagtgc 
cagagttgcctgggccctctgccaccccagtgccagagttgtcagggcccctttctaccc 

45 cagtgcctgagttgccagggccccctgcgacagtggtccctgagttgccggggccctctg 
tgacaccagtgccacagttgtcgcaggaattgccagggcctccagcaccatccatggggt 
tggagccaccacaggaggt 

50 Table 5: Sequence of Clone F4 (SEQ ID NO: 6)^ related to 
kidney androgen regulated protein 

acgcggggtaatttctttctcctgttcactggttgccttaaccctactaaagcatgatgc 
ttttcaaggtcctggtgatcactgtctcctgtggtctgactgtggctttccccctgtcag 
55 aattagtttcaatcaataaagaactacagaattcaatcattgacctactaaactcagtct 
ttgaccaactgggatcatacagagggacaaaagctcctctagaggattatacagatgatg 
atttaagcactgactctgagcagatcatggacttcacgccagccgcaaacaaacagaatt 
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ctgagttctctactgatgttgagacagtctcctccggctttctggaagaattcactgaga 
acacagacatcacagtgaaaattccattagctgggaatccagtctcccctacttcctgag 

aagactccaactttgaaataaacctgcgactgccattcaggatattgtctttgtagaaaa 
tgatataaatctaagatgctcccacaaagaagaaagaacgaggaagtgatcctggatgag 
aaatgctgtgccaggcatctctaagctctacccttactntgtgtgggtttaagaaataaa 

ccgttcatttgattgc 



Table 6: Sequence of Clone F5 (SEQ ID N0:7), related to 
claudin-10 

acgcggggnagtagggaattagaaagattattctaagctatctctaagatgcccttagtg 
tgatatactttattctaatttcccacacttcaagccatgagattatgaagaattgtcacc 
.tac.cctcatgggctggctttaacattcgtggaagttttgtccagtgtctgcatgcctagg 
•ccaccaaagccttctgtgtggactgtggcagcaggcaaggctgagcgacatgtccagggc 
acagatctcagctctggtgtgtggtgttggagggtttggtgctctcgtcgctgccaccac 
atccaacgaatggaaagtgaccacccgagcgtcgtctgtgattaccgccacctgggttta 
ccagggtctgtggatgaactgcgcaggtaacgctctgggctccttccactgccggccaca 
tttcactatcttcaaagtagaaggttacatccaggcatgtagaggactaatgatcgctgc 
ggtcagcctgggatttttcggttccatttttgcactctttggaatgaaatgtaccaaagt 
cggaggctcagatcaagccaaagctaaaatccgcttgctttggcccgggattggattcat 
attgncaggtctggggttccatgacaggctgttcccttgtntgcaaccaaatcncaacng 
aattctttgatcctntttatatggagcaaaagtttgaattaggggcttgtttttttatcg 
gatgggcngaacttttttntgntcatgggggagnatatttggttttaatatccacacacc 
aacnccaaaagggtacnntcaanggncccnttngcatgtttttggaccagtttaggngna 
aagaaaaaaaaaaaaaaaaaancctngccnancctagggan 



Table 7: Sequence of Clone F6 (SEQ ID NO: 8), related to 
kidney androgen regulated protein 

aacttctttctcctgttcactggttgccttaaccctactaaagcatgatgcttttcaagg 
tcctggtgatcactgtctcctgtggtctgactgtggctttccccctgtcagaattagttt 
caatcaataaagaactacagaattcaatcatcgacctactaaactcagtctttgaccaac 
tgggatcatacagagggacaaaagctcctctagaggattatacagatgatgatttaagca 
ctgactctgagcagatcatggacttcacgccagccgcaaacaaacagaattctgagttct 
ctactgatgttgagacagtctcctccggctttctggaagaattcactgagaacacagaca 
tcacagtgaaaattccattagctgggaatccagtctcccctacttcctgagaagactcca 
actttgaaataaacctgcgactgccattcaggatattgtctttgtagaaaatgatataaa 
tctaagatgctcccagcaaagaagaaagaacgaggaagtgatcctggatgagaaatgctg 
tgccaggcatctctaagctctacccttactctggggtgggtttaagaaataaacgttcat 
tttgaattccc 



Table 8: Sequence of Clone A8 (SEQ ID NO: 9) related to an 
immunoglobulin sequence 

acgcgggtgagcagcaccctcacattgaccaaggacgagtatgaacgacataacagctat 
acctgtgaggccactcacaagacatcaacttcacccatcgtcaagagcttcaacaggaat 
gagtgttagagccaaaggtcctgagacgccaccaccagctccccagctccatcctatctt 
cccttctaaggtcttggaggcttccccacaagcgacctaccactgttgcggtgctccaaa 
cctcctccccacctccttctcctcctccctttacttggcttttatcatgctaatatttgc 
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agaaaatattcaataaagtgagtctttgcacttg 



5 

Table 9: Sequence of Clone A34 (SEQ ID NO:10), related to an • 
immunoglobulin sequence 

• acgttgtaaaacgacggccagtgaattgtaatacgactcacttatagggcgaattgggcc 
10 ctctagatgcatgctcgagcggccgccagtgtgatggatatctgcagaattcgccctttc 
. gagcggccgcccgggcaggtacagtgtgggtttaccagtggacttgtccacggtcctctc 
ggtcaccaggtgtggcagggcctcgtggcctacaacacaggtataggtctctccggagtt 
ccattcctcctctgtcacagtcaggatgctgtgggtaaagtagaagcctggggccccagg 
ctctggcatcggggcactggtcacatacttctcttggggcaagagtitgccctccctgaag 
15 • ccactgcacactgatgtctgcaggagagaagcccttcaccaggcaggtgactgtggctga 
ctccctcaggttcagttgctcacgagctggtggcagcaggtacctcggccgcgaccacgc 
.taagggcgaattccagcacactg 



•20 . 

Table 10: Sequence of Clone A39 (SEQ ID NOrll)/ related to 
an immunoglobulin sequence 

acaagatggggactgggacgggcctttgactactggggccaaggcaccactgtcacagtcr * 
25 . tcctcagccaaaacaacacccccatcagtctatccactggcccctgggtgtggagataca * 
actggttcctctgtgactctgggatgcctggtcaagggctacttccctgagtcagtgact : 
gtgacttggaactctggatccctgtccakcagtgtgcacaccttcccagctctcctgcag 
tctggactctacactatgagcagctcagtgactgtcccctccagcacctggccaagtcag 
accgtcacctgcagcgttgctcacccagccagcagcaccacggtggacaaaaaacttgag 
30 cccagcgggcccatttcaacaatcaacccctgccctccatgcaaggagtgtcacaaatgc 
ccagctcctaacctcgagggtggaccatccgtctttatcttccctccsiaatatcaaggat 

gt 



35 



Table 11: Sequence of Clone A48 (SEQ ID N0:12),. related to 
. an immunoglobulin sequence 

40 

. acaccattccacctcccaaggagcagatggccaaggataaagtcagtccgacctgcatga 
.taacaaacttcttccctgaagacattactgtggagtggtagtggaatgggcagccagcgg 
agaactacaagaacactcagcccgtcacggacacagatggctcttacttcgtctacagca 
agctcaatgtgcagaagagcaactgggaggcaggaaatactttcacctgctctgtgttac 
45 atgagggcctgcacaaccaccatactgagaagagcctctcccactctcctggtaaatgat 
cccagtgtccttggagccctctggtcctacaggactctgacacctacctccacccctccc 
tgtgtaaataaagcacccagcactgccttgggaccctgcaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaagt 

50 



Table 12: Sequence of Clone B45 (SEQ ID N0:13)/ related to 
kidney androgen related protein 

55 

acgcggggaatttctttctcctgttcactggttgccttaaccctactaaagcatgatgct 
tttcaaggtcctggtgatcactgtcttctgtggtctgactgtggctttccccctgtcaga 
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attagtttcaatcaataaagaactacagaattcaatcattgacctactaaactcagtctt 
tgaccaactgggatcatacagagggacaaaagctcctctagaggattatacagatgatga 
tttaagcactgactctgagcagatcatggacttcacgccagccgcaaacaaacagaattc 
tgagttctctactgatgttgagacagtctcctccggctttctggaagaattcactgagaa 
5 cacagacatcacagtgaaaattccattagctgggaatccagtctcccctacttcctgaga 
agactccaactttgaaataaacctgcgactgccattcaggatattgtctttgtagaaaat 
gatataaatttaagatgctcccagcaaagaagaaagaacgaggaagtgatcctggatgag 
aaatgctgtgccaggcatctttaagctctgcccttactctgtgtgggtttaagaaataaa 

cgttcatttg 

10 



tTajbJe 13:' Sequence of Clone C22 (SEQ ID N0:14), related to 
disabled -2 p96 {Dab2) 
15 . ■ . 

actgtgtaggtttccaaggctatgaggaatatatttcaggcagatttttcaatgataacc 
ccaaatatttgtccatgattgtggcttcaagaanttttccagttacttgataaaggccta 
gtcatccagtaaggacccagtgtgtctagcatagtcacatttagctttcagaatacagtc . ^ 
cttcactgctgagcagaatccagttcccctactgtgttaaattgctctatgaaacttcag 
20 tctaagtcccccaagccatagggagaatgaagaaggtgcaccttccagaagtctagcccc 
tggactaacaatgactggagtgggtgcgtgcacttcagatctatgaaacatttaatgctt 
tttgccccgtgaaaaatccattcctggtaaaagggaggcactgtttatataaacccagga 
— gcaaaatgaactg 

25 

Table 14: Sequence of Clone E39^ related to an 
immunoglobulin sequence 

30 14A: beginning of coding strand (SEQ ID NO: 15) 

acgcgggggagccacacaaactcagggaaagctcgaatatggttttcaaacctcagatac 
ttggacttatgctttttaggatttcagcctccagaggtgatattggctacctcaatctcc 
agccaccctgtctgtgcctccaggagatagcgtcagtctttccttcagggccagccaaaa 
tattagcaacaacctacactgggttcaaaaaaaccacatgagtctccaaggcttctcatt 

35 aaagtcacacctggcctcatcacgttcggagctgggaccaagctggagctgaaacggact 
gatgctgcaccaactgtatccaatcttcccaccatccagtgaacagttaacatctggagg 
tgcctcagtcgtgtgcttcttgaacaacttctaccccaaagacjatcaantgtcaaagtgg 
gaagwtttgatgggcagatgaaaccgaacaaaaatggcgtccctggaacagtttgggaac 
tgatcagggacaggcaaagaacagcacctacagcatggaagcagcaacccttcaacgttt 

40 . gaccaaggaccgaagttattgaaaccgaccataacagctt 

I4B:end of coding strand, obtained by sequencing opposite 
strand^ deducing complement, and arranging in 5 '-3' order 
(SEQ ID NO: 16) 

45 nngnnccctcacgttgaccaagggnngagtatgaaggacataacagctatacctgtgagg 
ccactcgcaaggcatcaacttcacccattgtcaagaggttcaacaggaatgagtgttaga 
gacaaaggtcctgagacgccaccaccagctccccagccccatcctattttcccttttaag 
gtcttggaggttccccccaagcgacctaccactgttggggtgctccaaacctcctcccca 
cctccttttcctcctcctccctttccttggcttttatcatgctaatatttgcagaaaata 

50 ttccaataaagtgagtctttgcccttaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 



55 TaJble 15: Sequence of clone F27 (SEQ ID WO; 1 7, related to a 
105 kDa heat shock protein 
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acaaattggatgcaaaatccaaaattcgagccctccttcgtctccatcaggagtgtgaaa 
agttgaaaaagctcatgagttctaacagcacggacctgccgctgaacatcgagtgcttta 
tgaatgacaaggatgtctctgggangatgaacaggtcacagtttgaagaactgtgtgctg 
agctcctgcaaaaaatagaggtcccccttcactcgttgatggcacagactcagctcaagg 
5 ctgaanatgtgagtgccattganatagtgggaggtgcccaagaatcccanctgtnaaaga 
aagaattgccaanttctttggaaaagatgtnancacccacnctnaatgntgacnaagctg . 
taggccngnaggctgangcactgnctngtncaatnctttctccggcattttaaagttaca 
gantcntnttttcncnccatgctacccnttntccnaatatcanctggtcttggaacccnn 
tgnctcnnnangaaaacnttaacnntgttcnccnaccttntntcatnncnggaaccntgn 
10 ncgntnnttttnttccanacgtgtcnnnncctttnctnngcanaggggnggccctttngt 
nctacnaacnnttttcttottnaccncncnanggnantncnngtnnntaaancaanacna 
nnnnnccnnatannactgccnnaacntttntaccntcnanccgntnnttttcccnnncnc 
tanngtnanancctcnaannnncntngtntnttcatcntncnncnccntnncttgcnnnn 
angnccnnncacnnccannnnactnctcttcctannctcn 

15 

Table 16: Sequence of clone G38 (SEQ ID NO: 18, related to an 
immunoglobulin protein 

20 

acgcggggnattgtcattgcagtcaggnactcagcatggnacatgagggctcctgcacag 
natttttggcttcttgttgctcttg.tttccaggtaccagatgtgacatccagatgaccca 
gtctccatcctccttatctgcctctctgggagaaagagtcagtctcacttgtcgggcaag 

25 tcaggatattggtagtagcttaaactggcttcagcaggadccagatggaactattaaacg 
.cctgatctacgccacatccagtttagattctggtgtccccaagaggttcagtggcagtag 
gtctgggtcagattattctctcaccatcagcagccttgagtctgaagattttgtagacta 
ttactgtctacaatatgctacttctcctccgacgttcggtgggggcaccaagctggaaat 
caaacgggctgatgctgcaccaactgtatccatcttcccaccatccagtgagcagttaac 

30 atctggaggtgcctcagtcgtgtgcttcttgaacaacttctaccccaaagacatcaatgt 
caagtggaagattgatggcagtgaacgacaaaatgggcgtcctgaacagttggactgatc 
aggacagcaaaanacngcacctacagcatgancagcacccttacgttgaccaaggacnag 
tttgaacgacntancnctttacctgngaggccctnacaaaaatnacttncccattgnnag 
acttcacaggaatgagtgtanaaacaaggnctgnacnccccccnntnccanttcntctat 

35 ttcctttaaggttggaggttccccannacnccnttntgggggctcaaactntnccnctct 
tt 



Table 17 : Sequence of clone B3 

40 

17A: beginning of coding strand (SEQ ID NO: 19) 
acttgaactggccttcttatccaggataagtggagttcatgcaaggtggatccctcatga 
taggattagtggctttataagaacagtgattgagaaagaattcattctgttgacgcccat 
gcaagcggaacagcaaactgcttaagaactcatggaatgtggcccactggttagcaccat 
• 45 ctcaatgggtgcgccttggagaggcactacatgtcctgaggatcacatgctgctgtagag 
cgtcactctgcttgttacccttgtggtggtcatcatgtctccccctaatctatgagctng 
tatgaaaactctaagagggtttccagaccctcacagggcagtgtcattggcatttacaat 
• agttgttcttgatctttctcaagcattactgctggggttagattaatgaattcagtttac 
cattcaantaaagttagggttaaggtatttttgctaagtnggtttntgatcgtaccaaaa 
50 tcttggaggagcaatggtcaaagtttttagaaaggccntttangaaaagttttgataacc 
cnantggaaaggattnttttggangntcnagggnaancaaagnanttaaccaccaagttn 
cnt 

17B: end of coding strand, obtained by sequencing opposite 
55 strand, deducing complement, and arranging in 5 '-3' order 
(SEQ ID NO: 20) 

ctttttaagataaattgaaacctgccagtaagacaacctggggagaaaattgccttgtta 
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agttataacaagttctttggttatgaatgcatgtgggttcttgggaataatttgttttta 
acccataatatgtaaaacatttgatcatgtgaagctattgaggagcatatgtatctctct 
tttttaaaaatgtattctcattgtaattattcacttaaagaaaagtagaatgtaaccaca 
tcgtggttttatgctgcttgaaagaatttcctggtagatatataagctatggaagaaaaa 
5 taaaaattttggagcttgcatcatccgaaaaaaaaaaaaaaaaaaaaaaaaaagt 

Table 18: Sequence of clone F39 

18A beginning of coding strand (SEQ ID NO: 21) : 
10 acgcgggggagccacacaaactcagggaaagctcgaatatggttttcaaacctcagatac 
ttggacttatgctttttaggatttcagcctccagaggtgatattggctacctcaatctcc 
agccaccctgtctgtgcctccaggagatagcgtcagtctttccttcagggccagccaaaa 
** tattagcaacaacctacactgggttcaaaaaaaccacatgagtctccaaggcttctcatt 
aaagtcacacctggcctcatcacgttcggagctgggaccaagctggagctgaaacggact 
15 gatgctgcaccaactgtatccaatcttcccaccatccagtgaacagttaacatctggagg 
tgcctcagtcgtgtgcttcttgaacaacttctaccccaaagacatcaantgtcaaagtgg 
gaagwtttgatgggcagatgaaaccgaacaaaaatggcgtccctggaacagtttgggaac 
tgatcagggacaggcaaagaacagcacctacagcatggaagcagcaacacttcaacgttt • • 
gaccaaggaccgaagttattgaaaccgaccataacagGtt 

20 

IBB END OF CODING STRAND|. obtained by sequencing opposite strand, 
deducing complement, and- arranging in 5 '-3' order (SEQ ID 
.NO:22) : - ■■ 

nngnnccctcacgttgaccaagggnngagtatgaaggacataacagctatacctgtgagg 
25 .ccactcgcaaggcatcaacttcacccattgtcaagaggttcaacaggaatga^tgttaga • 
gacaaaggtcctgagacgccaccaccagctccccagccccatcctattttcccttttaag 
gtcttggaggttccccccaagcgacctaccactgttggggtgctccaaacctcctcccca 
cctccttttcctcctcctcccfcttccttggcttttatcatgctaatatttgcagaaaata 
ttccaataaagtgagtctttgcccttaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

30 

Table 19: Sequence of F2 (SEQ ID NO: 23) 

acctccccaaaatgactcttttggagcacagttacctccaatgcatcagcagcaaagcag 
atctgtaatgacagaagaatacaaagttccagatggaatggttggatttataattggcag 
aggaggtnaacagatctcacngngngcagcaggaatctggatgcangatacanatagcnc 

35 . ctgatagtggtggcctaccanaaaggtctngtatgctaactggaacacctgaatctgtcc 
agtcagcaaaaagattattggaccagattgttgaaaagggaagaccagcccctggccttc 
atcatggngatggacctggaaatgcagttcaggaaatcatgattccanccagcnaagcna 
ggactagnttattcggaaaggggggccgagactattaatcnnctttcncgaacggcctgg 
gtgttaaaatggttatgattnaagatnggcctcaaaaaccntgggggtgatnaacctctt 

40 agggattnggggttnnccatacaaagttnagcaagccaagcnnnangtattaggagttaa 
ttngtgaanaaggtggtttcagagaagtgcggaatgagtatggctcaanaataggaggca 
atgaagggatagatgtcccaatttccaagatttgctgttggcattgtaataggaagttnt 
ttnaaaaanaacnnccaaaaaaaaaaa 

• 45 

Table 20: Sequence of F21 

20A beginning of coding strand (SEQ ID NO: 24) : 

ttnngnantacngngcncaacccnttttaggaacggnccccantgtgctggaattcgccc 

ttagcttggtcgcggccgaggnnctacggctcaaaaccccgtcaccacggcccggatcgg 

50 gaagtgtcaagtgggagccgggttcccccgcatttgcagccgccacanctgaacccggac 
ccctcaccccagccgggcnctaccgccntttttaacntgtnctnnngngggngggatcct 
ggagttngntttgataaaaaaanccccccctgtgccngngggnagnaccnncnntntncc 
tccaaaccnngnctgctnccaataaaananaggncntcgtggnnttntatacatacaaaa 
ncnccctnatnatcngnaacnacnaccnnnntncctnnctngttcnantancccncancn 

55 gacaaatcnccctntncacatncccccnccnaacnnnctacaantcnacaaacccctnnc 
nacccccattnatcttagcccncccntaggncnancntacaccnanccaatcnngcacan 
cnaanacccngccnccccctccnctacnaacctccnnacctntcatacnttccnnccgtc 
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ncccnntnatcaacnnctccnntacnncnctcccccccccnannngttcccccaaancnn 
nccactaancacccgnacctttnnnncccntnaanagnccatcccctaantncccacccc 
ctttcnctcnntnactntntnnnnncccccncacacancantntacnnntnttctntnca 
• ccacaaccatttccccnantttnacntccaatctcantcctacccccnncnacacccnca 
5 cctcacttcaaccatanaccnnncncnttcgtccccancncaccncannncccctnncnc 
acncactcttcnnntcnntncnnnttcacccantaccccntcgcaacgntcccncnantn 

ccannntcccncacn 

10 20B end of coding strand, obtained .by sequencing opposite 
strand, deducing complement, and arranging in 5 '-3* 
order(SEQ ID NO: 25) : 

' gaagngnaaaattttttttaaaaaaaaaaaaaaaaaaaaaaa . 

15 Table 21: Sequence of F38 . 

.21A beginning of coding strand (SEQ ID NO: 26) : 
gntctgtgaaanganatagannggctcgaaggaccaaggcancttcctgactgaagacca 
gatcctgccgctgttgctgggtatcagcagaggccttgaggctattcatgccaaaggtta 
tgcacacagggacctgaagcccgtngnnnattttgcttggtgatgaggggcanccaantn 

20 taatggacatgggttntnttaatcanttcatgcattcaagtggagggctctcgccaggca 
ctaactcttcaggactgggcagctcagcggtgcaccatctcctaccgggcacctgaactt 

ttttctgtgcaaagcctn 

25 21B end of coding strand, obtained by sequencing opposite 
strand, deducing complement, and arranging in 5 '-3' 
order(SEQ ID NO:27): 

ctagnggcctttgtttttnntnaggtttcccggggggtggcnccctttttntttntnggg 
ccccccttttttgnantggntntttttttncnncgccctctgggggggnntcncgcggcg 
30 cntttttttgttgngggggntttgggggngncnntttcttcgtgggnccgntnttttttt 
tntncnaaaccccnngnttcnttnttgggggctntttgtgnnncgnngggggnngntttt 
tcccttntttttngggggnggccccncnntattgnngtttnccaanngggggcganngng 
. cccntcttgggccaaattnagngngcntcccccncannccccccggctttttttancntt 
gtgnanntttttntttttttnttgtgggggggggccccccgcgggggnctncntnccctn 
35 tncnncncngtnaggntgggggnntttccccccccngnttnnggccgnncccccccccnn 
tttnnntnnannnngngggcnttttggggnangncccntnnaggggntttttncccccnt 
tgggncncnnnccntttttnccgggnggntttnccngntnnnggcanggnnagggggncc 
ntnttttttttccctttttccccccantcnngggnnngggccctngggggggnngggggg 
aantttanccnnnantttgggncnaanntngggnngggggggcccccnncnttttntttc 
' . 40 ccntntngggggnnaaaaaaancntnaaaaggnnaatnangttaaacccncccaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
- aaaaaaaaaaaacaaaaaaa 

Table 22: Sequence of F40 (SEQ ID NO: 28) 

45 acgcggggtttccggcttgaggatccgtcgcgtttctgaggcggcgcctcagtcggcagt 
taggatcggctcggagcggtgggtctcggccccggcgtcaccatgggggccgtcctcggc 
gtcttctccctcgccagctgggtcccgtgcctctgtagtggtgcatcatgtctgctgtgc 
agttgctgtcccatcagtaagaattccactgtaactcgggtcatctacgcttttatcctc 
ttccttggcactattgtgtcttgcatcatgatgacagaaggcatacaaactcaactgaag 

50 aagattcctggattctgtgaaggaggatttcaaatcaagatggttgatacaaaggcagag 
aaagattgtgacgtgctggtcggttttaaagctgtgtatcggatcaactttgctgtggcc 
atttttttctttgccttctttttgctcatgttaaaagttaaaacaagtaaagatcccaga 
gcagcagtgcacaacgggttttgggttcttcaaaatcgctgccattatggtatcatgatt 
ggatctttctacatccctggggggagttttactgaagtctggttttttgctggaatgttg 

55 ggggcctctttcttcattatcatccagctggtgctcttggtagacatggctcactcttgg 
aatgaattatgggtaaatcgaatggaggaaggaaacccaaggctctggtatgctgccttg 
ctgtcctttacaagcctcttttacatcctctccatcgtctttgccgcgctgctctacgtc 
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ttttacaccaagcctaacgactgcacagaaaacaaggtcttcatcagcctcaacctgatt 
ttttgtgttgcagtttcatattgtgtccatcctccgtaaagttcaggaacatcagcctcg 
ctctggcctcctgcagtcctccatcattactaaga 



TABLE 23: SEQUENCE OF G9 (SEQ ID NO: 29) 

acgcggggatctaggaggctgctgacctccaacaggaatttcaccacttaaccctctaga 
agtcccactacttaatacttcagtacttctagcatcaggtgtttcaattacatg^gctca 
tcatagccttatagaaggtaaacgaaaccacataaatcaagccctactaattaccattat 
10 actaggactttacttcaccatcctccaagcttcagaatactttgaaacatcattctccat 
ttcagatggtatctatggttctacattcttcatggctactggattccatggactccatgt 
aattattggatcaacattccttattgtttgcctactacgacaactaaaat.ttcacttcac 
atcaaaacatcacttcggatttgaagccgcagcatgatactgacattttgtagacgtaat . 
cttgacttttcctatacgtctccatttattgatgaggatcttaaaaaaaaaaaaaaaaaa ; 
. 15 aaaaaaaaaaaaaaaaaaaaagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

TABLE 24: SEQUENCE OF G16 (SEQ" ID NO:; 30) 
• • .acgcggggnagttgcggccctgtgcgcttccacagcgtcccagcctttgtcctggccbca 
. gctcctgcttcctgacctgcttgagttccagtcctgacttcctttggcgataaacagcag 

20. tatggaaatacctcacagtggatgttcctggccatggaaaggtgctggtggacattgcat 
atggtggggcattttatgcgtttgttagtgcagaaaaattaggacttgatgtgtgttctg 
cgaagacgagggaccttgtggatgcggcaagtgcattgacaggagcagtgaaagcacagt 
ttaaaatcaaccatcctgagagtgaagaccttggttttttgtatggatccatcttgacgg 
atggaaaagatgcttatagtgaggaggccaccaccaacatctgcgtgtttgccgatgaac • 

25 aggttgacagaagccccaccggctcgggagtgacagccagaattgctctgcagtatcata 
aggggctcctgcagctgaaccagaccagagccttcaaaagcagcgcaactggctcggtgt - 
tcacaggctgtgccgtgagggaagcaaagtgtggagatttcaaagctgtcatagtggaag 
? ttgcaggacaagcccacttacncagggrcagcaaacttgmcagtggaagatggtgacccc 
ctaagggatggcttttttttcaagkgacttttttttcataagttttaaaggntttgctta 

30 naaagaaatwccycttaaatggggntttutctgnattatgggcnaaccngtntccngtta 
cccttaccgggtaatttttanactttttaaa^agtatgtttagtanactgngtttaaatg 
taaggnnttatacttcctntttgtcacccntttgngcataaatgtngntaatggggcnaa 
canagngccaanaatccggaaacttttnaagccttngtntncaaggcnaaattnaagccc 
ngttncttaatttntgctgnggaaatgataaaacttgtaagttttttttgattgntagnt 

35 tcagctntnanccccccnatagtaaatgttgngngtttgttnttttaatgttccngtt1:g 
ngtaaaataaaggcntcntgtaagnttccaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaancaaaaaaaaaaaaaaaaaaaaaaaaa 

TABLE 25: SEQUENCE OF G24 (SEQ ID NO: 31) 

40 acgcggggagactggggaaa-tcgaaattatttcagcagctgtcttcagtcactgggtgtg 
tcttcctgcacagtggacttgacgggcctggagggaggagaagaaattgaaacctatctc 
atcctctagtttatttacatcattggattcggttcctgtttggaaacttaactggtcctc 
atggcgttttcgggcacagggacactggtggcttccattgtctccaagatgacatcttca 
gcagccatggtcaaagctggaggggcagtctccggcaGcattctagctggttcgaagggg 

45 ctcatcctgttaacacagagtgccctgggctctgcaacatctgcacttggagcaccgaag 
gttggcaccattttatctgggttctctgcctccaccttggctgcttcccccattggagcc 
aaggctgttgtggctgtgcttgggggagccatgacagtagctgctgtaccacctgcgctg 
agtgctgtgggcttcaccgcctcaggaattgcagcctcctctctagcagctaagatgatg 
tccttgtcagctattgctaatgggggtggagtcccagctggtggcctggtggccattttg 

SO cagtctgctggagctgctggactttccgtgccatctactgtcatcgtgggctttgcaggg 
tttgccgttgtggccagtgtcatgaacatctgtgaaagtttttacccttttttcatggga 
tcagaggtggcagacatggcgacaaaggtggcagatatggcgataaaggtggcagacatt 
tcaacaaaaaaggcactgcccaacttgtttaacccagagaaggattaattacccccaggc 
gacaatttcctaacacatattgcctaggtggaaataaattaaaccgagtttttaaaaaaa 

55 aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 



TABLE 26: SEQUENCE OF G28 (SEQ ID' NO: 32) 
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accaccggnaccacagtncagcaacgtcaagnatggacatcctgtcttacatgnaggccg 
gcaggcagccctgagcgccagggggaagtaactgatgcccccaccctacccctaccccct 
gcccatcatgcaagagccgaggaggggctcttcaggaaggaagccacattcccagtcatt 
ctacccccaccccagattctctttcttattacataaaagacaagcctggcacaactgtgt 
5 gtctgaaccactgtggacacgtgacaattgtcccagtgtgtgcatggctacacagccacg 
tatctgcctgcttgaaacccagggatggtccatctgtgtttacggcttggcacaacaccc 
tcatatttttttcagctttctgttccaaatgagcccaaaggaaacacaagttctaggtcc 
. aatggttctgctcaaacctgaacatcattcttggggccagcatctcccacacgcccacac 
tacacaccaccagcctccttcttccttcctgaaggaccctcctgagccccoaagcccatc 

10 . ccacagtgctcctgagaccagccaagacaactgtgagcgcgatggccgtgtaccttgccc 
gggcggccgtcgaaagggcgaattccagccacttggcggccgtactagtggatccgagct 
cggtccaacttgatgcatacttggantnttntatagtggcacctaaatagcttggngtaa 
tntgggnatagctgnttctgngngaaatggtnttccgttanantccnacaacnacaaccc . 
griacataagtgtaaanctggggngcnangangncnnactanttantnnntngnctnntgc 

1 5 cnttcannngnaactnnnncnntnntnnantgcaccccggnnggngntnn 

TABLE 27: SEQUENCE OF 646. (SEQ ID NO:33) 

accacggggacaggatttgggacgttcctgatgaccgtgttgctgaaatacctgtgcgca 
gaatatggctggcggaatgctatgttcatccaaggtgctctgtccctgaacctgtgtgtc 

20 tgtggggcgctcatgaggcccctctctcccgagaagttagaaaactgcccagaagcggaa. 
gagccgtgtgctctcccagcttactccactgaatctgttaagtctgggggaccactgggc 
atggctgaagaacaggacagaaggcccgggaatgaggagatggtgtgtgaccttcaaaca 
caggagtgtcaaggtcaaacccatccgaggaagaatgtgtgtgccttccgggttctgaag^ 
acagtgagccagctcactgtgcaagtccggaggggctttagggactggcactcaggctat ^ 

25 tttgggacggcttcactcttcaccaaccgcatgtttgtagcctttattttctgggctctg /. 
• ttcgcatacagcagctttgtcatccetttcatccacctgcccgaaatcgtcagtttgtat . 
aacttgtcngaaccaaaatgacacattccctcttgacctcgattatagcaatacttcaca 
tcttcgggaaggtgatctttgggggccgtggnccacctcccttgtatcancgtctggaat 
ggtcttcctnatnngttacttnaccctcgtcctnnnatnttnctttttgcctttgatgcn 

30 cacctccccaanntggntnnctttgggccctaatgggntttctagcggttttntcctnnn 
cc.tggggnacnangactgntngcntnnsLCcttngcnatgctacgnntntattnnccangg 
ntttnncttntngancccntnnngaggatttncnccca 

TABLE 28: SEQUENCE OF F16 (SEQ ID NO: 34) 

35 acttgctggtggtccatccctgcactatttccaagccaggtcttgggatcagtgacttgc 
ttcctgcagtgctgtggtcacaggcccctccagtcagccatgtctagcagcttggcaggc 
ttaggggctcggtgtgcagatttctttttgtgcagcagaatgccaagagactgagcagag 
agtcttcttaaaacctaccctaagggccgccaggtccatgtgcccgtggtaggtctgggg 
ccaggcagctgctgctgtcctgatcccccctcccttagctatggatccttgtagtagagg 
.40. tcagccgct cctggcgcaaacaggatccagtagctgtctgccacccacgaQgtt tctaag 
ataccatgtggagacagggtcctccccatccttttcagaaatagcagcttgttttctctg 
ctggcatgtgtccagggcabcaggtcccagcagggggaggcagactgttaactgaacscg . 
cataggaacaggaggcagcgagagcctgtgggaggaagggccttctagtgaccctgcttc 
tccatcctgcctccctcctgtccacccagtttcagagttgggaggggaggagccatggct 

45 ccctcacccaggcttggtggctgtggccctgttttagagccatggcagcacaggctggtg 
agtggtagagttgt 

In the above Tables, ^"N" denotes ^^unknown". 
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Comments on Clones and Cogna-be Dat:€Jdase Sequences 

In these comment,. ^^Full-length'' means that it includes 
complete coding sequence 

5 

B3 clone is not full-length • It is similar to a DNA 
sequence in the database but does not match any protein in 
the database (no protein is given for the DNA sequence) . 

10 B4S clone is full-length as are the mouse and human 

homologs. B45 is- an exact match to the database proteins 
whereas F4 and F6 have a one amino acid difference. - • 

B46 clone is possibly full-length. 
15 BAB22782 hypothetical protein - putative [Mus 

musculus] is full-length. 

XP_087215 similar to unnamed protein product [Homo ' 
sapiens] is full-length. 

B2ffi70909 unnamed protein product [Homo sapiens] ^is 
20 full-length. 

Ci22 clone is not full-length. Similarity is to 3' non- 
coding region of AP260579S2 Mus musculus disabled-2 p96 
(Dab2) gene^ exons 2-15 that encodes protein AAG44669 

25 disabled-2 p96 [Mus musculus] (full-length) . This protein 
is similar to AAH03064 disabled (Drosophila) homolog 2 
(mitogen-responsive phosphoprotein) [Homo sapiens] , P98082 
Disabled homolog 2 (Differentially expressed protein 2) 
(DOC- 2) [Homo sapiens] , and AAF23161. disabled-2 [Homo 

30 sapiens] (all full-length) . 

PI clone is not full-length. 

AAP23120 SON protein [Mus musculus] is full-length. 
NP_003094 SON DNA binding protein; SON DNA-binding 
35 protein; SON DNA-binding protein, KIZ^IOIS; NRE-binding 

protein [Homo sapiens] are full-length. 
AAK07692 NREBP (negative regulatory element-binding 
protein) [Homo sapiens] is full-length. 
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F2 clone is not full-length. 

NP_476513 DNA segment, Chr 3, ERATO Doi 330, 
expressed; far upstream element (FUSE) binding protein 
1 [Mus musculus] is full-length. 

AAB14763 Similar to far upstream element (FUSE) 
binding protein 1 [Mus musculus] is full-length. 
NP_003893 far upstream element-binding protein; far 
upstream element binding protein; FUSB-binding protein . 
[Homo sapiens] is full-length. 

AAA17976 FUSE* binding protein [Homo sapiens] is full- 
length. 

A53184 myc far upstream element -binding protein - . 
human - is full-length. 

AAH17247 far upstream element (FUSE) binding protein 1- 
[Homo sapiens] is full-length.* 

P4 and P6 clones are full-length. 

NP_034724 kidney androgen regulated protein [Mus 
musculus] is full-length. 

Several other mouse matches are full-length, 
AAa08576 kidney androgen regulated protein [Homo 
sapiens] is full-length. 

AA650272 FKSG22 [Homo sapiens] is full-length. 

P5 clone is possibly full-length. 

NP_076367 claudin 10; claudin-10 [Mus musculus] is 
full-length- 

BAB32005 and XP_127876 putative - similar to CLAUDIN- 
10 [Mus musculus] are full-length. 

BAB71030 unnamed protein product [Homo sapiens] is 
full-length . 

Note 1: Although, the human protein is ^^unnamed", it is 
very similar to the mouse Claudin-10 listed above. 
Note 2: The first 73 amino acids of these Claudin-10 
proteins (including Clone F5) differ significantly from 
other proteins that are also named Claudin-10. 
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P16 clone encodes an unknown protein. The only significant 
similarity is to a genomic nucleotide sequence of Mus 
musculus. It is not known if P16 is full-length. 

P21 clone encodes an unknown protein. A small portion of 
nucleotide sequence is similar to a mouse clone encoding an 
unknown protein. It is not known if .P21 is full-length. 

F27 clone is probably not full-length. Only a portion of 
its protein sequence is similar to Mus musculus and Homo, 
sapiens Heat shock 105 kD protein (both of which are full- 
length) . The rest of the P27 sequence is novel. 

P38 clone is possibly full-length. 

AM>02811 palmitylated serine/threonine kinase [Mus 
musculus] is probably full-length. 

BAA89662 F5-2 [Mus musculus] is probably full-length. 
NP_^035624 and others serine/threonine kinase 16 [Mus 
musculus] is probably f uli-length . 
NP 003682 and others serine/threonine kinase 16; 
protein expressed in day 12 fetal liver [Homo sapiens] 
is full-length. 

CA2^06700 PKL12 protein [Homo sapiens] is full-length. 

P39 clone is probably not full-length. Only a portion of 
its protein sequence is similar to Mus musculus' and Homo 
sapiens Phosphotriesterase Related Protein (both of which 
are full-length) . The rest of the P39 sequence is novel - 

*F40 clone is not full-length. 

AAH11295 Similar to tumor differentially expressed 1 
[Mus musculus] is full-length. 

AAH22901 Unknown [Mus musculus] is full-length. 
NP_036162 tumor differentially expressed 1 [Mus 
musculus] is full-length. 

A2U354420 membrane protein TMS-1 [Mus musculus] is 
full-length. 
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AAB48858 Diff33 gene product [Homo sapiens] is full- 
length. 

AAD34641 transmembrane protein SBBI99 [Homo sapiens] 
is full-length. 

NP_006802 tumor differentially expressed 1; placental 
transmembane protein [Homo sapiens] is full-length. 

69 clone is not full-length. 

CAA24Q90 cytochrome oxidase III {Mus musculus] is 
full-length. 

AAK17824 and AAI<54598 cytochrome c oxidase subunit 
III [Homo sapiens] are full-length. 

G16 clone is not full-length. 

BAB31217 evidence zNAS-'hypothetical protein-'putative 
[Mus musculus] is full-length. 

AAB04753 similar to RIKEN cDNA 2810055F11 gene [Mus ' 
musculus] is full-length. 

AM12131 similar to RIKEN cDNA 2810055F11 gene [Homo 
sapiens] is full-length. 

624 clone is full-length. 

BAB22924 homolog to ERG2 protein ^ putative [Mus 

musculus] is full-length. 

AAB22800 Unknown [Homo sapiens] is not full-length. 
NP_114425 and AA635730 TLH29 protein precursor [Homo 
sapiens] appear to be full-length. 

626 clone is not full-length. 

NP_032320 and others hydroxysteroid dehydrogenase-4 ^ 
delta<5>-3-beta; 3-beta-hydroxysteroid 

dehydrogenase/delta-5-delta-4 isomerase [Mus musculus] 
are full-length. 

NP__000853 hydroxy-delta-5-steroid dehydrogenase, 3 
beta- and steroid delta-isomerase 1; Hydroxy-delta-5- 
steroid dehydrogenase, 3 beta- and steroid [Homo 
sapiens] is full-length. 
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628 clone is probably not full-length. 

BAB21943 and BAB23686 glutathione peroxidase 3 
putative [Mus musculus] are probably not full-length. 
XP_087620 siiftilar to glutathione peroxidase 3 (plasma) 
[Homo sapiens] may be full-length. 

HI Clone is not full-length. Similarity is to 3' non-coding 
region of U96635 Mus musculus ubiquitin protein ligase 
Nedd-4 mRNA that encodes protein AAB63360 NEDD-4 [Mus 
musculus] (full-length) . ' This protein is similar to P46934 
NEDD-4 protein [Homo sapiens] and..BAA07655 KIAA0093 gene 
product is related to NEDD-4 protein [Homo sapiens] (full- 
length) . 

• H8 clone is nearly full-length. 

NP_038505"" ATPase, H+ transporting, lysosomal 38kDa, VO 
subunit D isoform 1; ATPase ,H+ transporting, lysosomal 
(vacuolar proton p\amp) , 42 kDa [Mus musculus] is full- 
length. 

XP_088368 similar to Ac39/physophilin [Homo sapiens] 
is full-length. 
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CLAIMS 

1. A method of protecting a human subject from kidney 
damage which con^rises administering to the subject a 
protective amount of an agent which is 

(1) a polypeptide which is substantially structurally 
identical or. conservatively identical in s.eqilence to a 
reference protein selected from the group consisting of 

.(androgen related protein) NP_034724 and P15267; 

(SON protein) NP_064357, AAF23121, NP_003094, and AAK07692; 

(FOSE binding protein 1) NP_476513, NP_003893, and AAAn976; 

(claudin 10) BAB32005, NP_067361, NP_076367, XP_127876, 
NP_008915, and BAB710301; 

(heat shock protein) BAA11035 and XP_036357; 

(phosphotriesterase related protein) NP_032987 and 
NP_109589; 

(ubiquitin protein ligase Nedd-4) AAB63360 , P46934, and 
BAA07655; and 

(Ac39/physophilin) NP_0385 and XP_08836; 
or 

(2) an expression vector encoding the polypeptide of (1) 
above and expressible in a human cell, under conditions 
conducive to expression of the polypeptide of (1); 

where said agent protects said subject from kidney damage. 
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2. A method of protecting a human subject from kidney damage 
which comprises administering to the subject a protective 
amount of an agent which is 

5 (1) an antagonist of a polypeptide, occurring in said 

subject, which is substantially structurally identical or 
conservatively identical in sequence to a reference protein 
selected from .the group consisting of 

* . 

10 (disabled-2 p96) AAG44669., AAH03064, P98082, and AAF23161; 

(palmitylated serine/threonine kinase) AAD02811, BAA89662, 
NP__035624, NP_003682, and CAA06706; 

(tumor differentially expressed 1 protein) AAH11295, 
AAH22901, NO_036162, AAD54420, AAD22448/ AAB48858, AAD54'420, 
NP_006802, AAB48858, and AAD34641; • 

(cytochrome oxidase III) 07^24090, AAK17824, and AAL54598; 

(TLH39 protein precursor) BAB22924, AAH22800, NP_114425, and 
AAG335730; 

(hydroxysteroid dehydrogenase 4 delta <5>-3 beta) NP_032320, 
NP_000853, and AAA51831; . and 

(glutathione peroxidase III) BAB21943, BAB23686/ and 
XP_087 620; or 

30 (2) an anti-sense vector which inhibits expression of said 
polypeptide in said subject, 

where said agent protects said subject from kidney damage 
mediated by said polypeptide. 

35 

3. A method of protecting a human subject from kidney 
damage which comprises administering to the subject a 
protective amount of an agent which 
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(1) 

(a) down-regulates expression of an ^'unfavorable" 
protein which is identifiable as a homologue in said subject 
of a mouse marker gene which is 

• » 

(i) up-regulated in a first group of mice, which 
are experiencing or are prone to kidney damage / and/ or 

(ii) down-regulated- in a .second group of mice^ 
which are protected against or otherwise less prone to 
kidney damage, relative tdr the other group, or 

(b) is an . antagonist for the expression product of the 
''^unfavorable" gene, or 

(c) degrades, that product, 

or 
(2.) 

(a) up-regulates expression of a ^^favorable" protein 
which is identifiable as a homologue in said subject of a 
mouse marker gene which is . . 

(i) down-regulated in said first group of mice 

and/or 

(ii) up-regulated in said second group of mice, 
relative to the other group or 

(b) is an agonist for the favorable protein, or 

(c) inhibits the degradation of the favorable protein, 

or 

(d) is said favorable protein or a protein which is 
substantially or conservatively identical thereto', or 
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(e) is an expression vector comprising a DNA sequence 
encoding said protein (d) and operably linked to a promoter 
whereby said protein (d) is expressed in cells of said 
subject which are transformed by said vector^ . 
where said agent protects said subject from kidney damage. 

4. A method of screening .for human subjects who have 
developed^ or are prone* to development of, kidney damage, 
which comprises assaying tissue or body fluid samples from 
said subjects to determine the level of expression of a 
^"favorable" human marker gene, said human marker gene being 
substantially structurally identical or conservatively 
identical in sequence to a protein selected from the group 
consisting of 

(androgen related protein) NP_034724 and P15267; 

(SON protein) NP_064357, AAF23121, NP_003094, and AAK07692; 

(FUSE binding protein 1) NP_476513, NP_003893, and AAA17976; 

(claudin 10) BAB32005, NP_067361, NP_0T6367, XP_127876, 
NP_008915, and BAB710301; 

(heat shock protein) BAA11035 and XP_036357; 

(phosphotriesterase related protein) NP_032987 and 
NP_109589; 

(ubiquitin protein ligase Nedd-4) AAB63360 , P46934, and 
BAA07 655; and 

(Ac39/physophilin) NP_0385 and XP_08836; 

and directly correlating the level of expression of said 
marker gene with the development of kidney damage in said 
patient • 
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5. A method of screening for human subjects who have 
developed, or are prone to development of, kidney damage, 
which comprises assaying tissue or body fluid samples from 
said subjects to determine the level of expression of an 
^"unfavorable" human marker gene,, said human marker gene 
being substantially structurally identical or conservatively 
identical in sequence to a protein selected from the group 
consisting of 

(disabled-2 p96) AAG44669, AAH03b64, P98082/ and AAF23.161; 

. (palmitylated serine/threonine kinase) 2^002811, BAA89662, 
NP__0?5624, NP_003682, and CAAD6700; . 

(tumor differentially expressed 1 protein) AAH11295, 
AAH22901, NO_036162, AAD54420, 2^022448, AAB48858, AAD54420, 
NP__006802, AAB48858, and AAD34641; 

(cytochrome oxidase III) CAA24090, AAK17824, and AAL54598; 

(TLH39 protein precursor) BAB22924, AAH22800, NP_114425, and 
AAG335730; 

(hydroxysteroid dehydrogenase 4 delta <5>-3 beta) NP_032320, 
NP_000853, and AAA51831; and 

(glutathione peroxidase III) BAB21943, BAB23686, and 
XP_08 7 62 Gr- 
and inversely correlating the level of expression of said 
marker gene with the development of kidney damage in said 
patient . 

6. A method of screening for human subjects who have 
developed, or are prone to development of, kidney damage, 
which comprises 
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assaying tissue or body fluid samples from said 
subjects to determine the level of expression of at least 
one human marker protein, where said human marker protein is 
identifiable as a homologue of a mouse marker gene which is 
expressed at different levels in a first group of mice who 
are experiencing or are prone to kidney damage and in ,a 
second group of mice protected against or otherwise less 
prone to kidney damage, and 

correlating said level of expression of said human 
marker gene with- the development of kidney damage in said- 
subject . 

7. The method of claims 4 or 6 in which the marker . 
gene is one whose expression is down-regulated in the first 
group of mice. 

8. The method of claims 4 or 6 in which the marker 
gene is one whose expression is up-regulated in the second 
group of mice and/or down-regulated in the first group of 
mice . 

9. The method of claims 4 or 6 in which the first 
group of mice are treated with streptozotocin, 

.10. The method of claim 9 in which the second group of 
mice are treated with streptozotocin, but are genetically 
modified mice protected from diabetes-induced kidney damage 
by virtue of their genetic modification. 

11. The method of claim 10 in which the genetically 
modified mice are transgenic mice which express a growth 
hormone (GH) mutant which. is a GH receptor antagonist which 
antagonizes a growth hormone endogenously expressed by said 
mice. 

12- The method of claim 10 in which the genetically 
modified mice are mice which do not express a functional 
growth hormone receptor binding protein (GHRBP) . 

13. The method of claim 9 in which the second group of 
mice are nontransgenic mice which are not treated with 
streptozotocin . 

14. The method of any one of claims 4-6 in which the 
level of expression of the marker protein is ascertained by 
measuring the level of the corresponding messenger RNA. 
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15. The method of any one of claims 4-6 in which the 
level of expression is ascertained by measuring the level of 
a protein encoded by said marker gene. 

• 16. The .method of any one of claims 1-8, 14-15 in 
which the subjects are diabetic and the kidney damage is, at 
•least in part, diabetes-induced. 

17. The method of any one of claims 1-8, 14,15 in which 
the subjects are hyperinsulinemic . 

18. The methpd of claims 4 or 6 in which the first 
group of mice exhibit kidney damage. 

19. The method of claims 4 or 6 in which the first 
group of mice are genetically modified to overproduce growth 
hormone . 

20. The method- of claim 18 in which the first group of 
mice produce a heterologous growth hormone in addition to 
mouse growth hormone-; " — 

22. The method of claim 18 in which the second group 

of mice are normal mice . 

23. The method of any one of .claims 1-3 in which the 
agent is a DNA, or is encodable by a DNA, which specifically 
hybridizes to the recited DNA strand of any of SEQ ID NOs: 
1-34, or to the complementary strand thereof. 

24. The method of any one of claims 1, 2, 4 or 5 in 
which said polypeptide is at least 80% identical or at least 
highly conservatively identical to said reference protein. 

25. The method of any one of claims 1, 2, 4 , 5 or 24 
in which said polypeptide i6 at least 90% identical to said 
reference protein. 

26. The method of any one of claims 1, 2, 4 or 5 in 
which said polypeptide is identical to said reference 
protein. 

27. The method of claims 4 or 6 in which said homologue 
is identifiable by a BLASTN or BLASTX search conducted, 
using any of SEQ ID NOS: 1-34 as a query sequence, on the 
NCBI Entrez sequence database (s), on or before the fili^ig 
date of the instant application, and the E value calculated 
by BLASTX or BLASTN for the alignment of that homologue, or 
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cDNA encoding that homologue, to the query sequence is less 
than e-10. 

28. The method of claim 27 in which the E value 
calculated by BLASTN* or BLASTX would be less than e-15, more 
preferably less than e-20, still more preferably less than 
e-40, even more preferably less than e-60, considerably more 
preferably less than e-80, and most preferably less than e- 
100. 
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